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(57) Abstract: Methods for diagnosing and monitoring ovarian cancer in a subject connprising measuring a plurality of kallikrein 
polypeptides, and optionally CA125, or nucleic acids encoding the polypeptides in a sample from the subject. ITie kallikrein polypep- 
tides include kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and kallikrein 11. 
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TITLE : Multiple Marker Assay for Detection of Ovarian Cancer 
FIELD OF THE INVENTION 

The invention relates to compositions, kits, and methods for detecting, characterizing, preventing, 
and treating ovarian cancer. 
5 BACKGROUND OF THE INVENTION 

Epithelial ovarian carcinoma is the most common and most lethal of all gynecologic malignancies. 
Only 30% of ovarian tumors are diagnosed at an early stage (Stage I/II), when survival rates reach 90%. The 
rest are diagnosed at an advanced stage, with survival rates of less than 20% ( Greenlee RT, Hill-Harmon 
MB, Murray T, et al., 2001. CA Caficer J Clin .2001;51:15-36). Currentiy, the only well-accepted 

10 serological marker is CA125, a large glycoprotein of unknown function (Meyer T, Rustin GJ., Br J Cancer 
.2000;82:1535-1538). However, CA125 has limitations as a diagnostic, prognostic and screening tool 
(Holschneider CH, Berek JS, Semin Surg Oncol .2000;19:3-10). Consequently, there is a need to enhance 
the overall diagnostic/prognostic capability of CA125. 

Kallikreins are a subgroup of secreted serine proteases, encoded by highly conserved and tightiy 

15 clustered multigene families in humans, rats and mice. The human kallikrein gene family resides on 
chromosome 19ql3.4 and is comprised of 15 members, whose genes are designated as KLKI to KLKI5 and 
the corresponding proteins as hKl to hK15 ( Yousef GM, Diamandis EP., Endocr Rev .2001;22:184-2041; 
Yousef GM, Chang A, Scorilas A, et al., Biockem Biophys Res Commun. 2000;276: 125-133; Diamandis EP, 
Yousef GM, Clements J, et al. Clin Chem .2000;46:1855-1858). Kallikreins are expressed in a wide variety 

20 of tissues and are found in many biological fluids (e.g. cerebrospinal fluid, serum, seminal plasma, milk, 
etc.) where they are predicted to process specific substrates. Kallikreins may participate in cascade reactions 
similar to those involved in digestion, fibrinolysis, coagulation, wound healing and apoptosis (( Yousef GM, 
Diamandis EP., Endocr Rev .2001;22:184-2041). Many kallikreins have been found to be differentially 
expressed in endocrine-related mahgnancies (Diamandis EP, Yousef GM, Expert Rev, MoL Diagii 

25 ,2001;1: 182-190), including prostate ( Barry MJ. Clinical practice, N Engl J Med .2001;344:1373-1377; 
Rittenhouse HG, Finlay JA, Mikolajczyk SD, et al., Crit Rev Clin Lab Sci .1998;35:275-368; and Yousef 
GM, Scorilas A, Jung K, et al., J Biol Chem .2001;276:53-61), ovarian ( Kim H, Scorilas A, Katsaros D, et 
al., Br J Cancer, 2001;84:643-650; Anisowicz A, Sotiropoulou G, Stenman, et al., Mol Mei/ .1996; 2:624- 
636; Tanimoto H, Underwood U, Shigemasa K, et al.,. Ca«cer .1999;86:2074-2082; Magklara A, Scorilas 

30 A, Katsaros D, et al., ain Cancer Res .2001;7:806-81 1; Yousef GM, Kyriakopoulou LO, Scorilas A, et al.. 
Cancer Res .2001;61:7811-7818; Luo L, Bunting P, Scorilas A, Diamandis EP., Clin Chim Acta 
.2001;306:111-118), breast ( Yousef GM, Magklara A, Chang A, et al.. Cancer Res .2001;61:3425- 
3431;Yousef GM, Chang A, Diamandis EP; J Biol Chem .2000;275:1189M1898; and Yousef GM, 
Magklara A, Diamandis EP, Genomics .2000;69:331-341), and testicular cancer ( Luo LY, Rajpert-De Meyts 

35 ER, Jung K, et al.,2001; 85:220-224). hi addition, many kallikrein genes examined thus far are under steroid 
hormone regulation, implicating a role for kallikreins in endrocrine-related tissues (Yousef GM, Diamandis 
BP., Endocr Rev,, 2001;22:184-204). Fmthermore, hK6, hKlO and hKll have been recently identified as 
novel serological ovarian cancer biomarkers ( Luo L, Bunting P, Scorilas A, Diamandis EP., Gin Chim Acta 
.2001;306:111-118 Diamandis EP, Yousef GM, Soosaipillai AR, Bunting P., Clin Biochem. 2000;33:579- 
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583, and Diamandis BP, Okui A, Mitsui S, et al, Cancer Res .2002;62:295-300). 
SUMMARY OF THE INVENTION 

The present invention seeks to overcome the drawbacks inherent in the prior art and seeks to 
provide sensitive and accurate multimaiker methods for the detection of ovarian cancer. A plurality of 
5 kallikrein polypeptides and polynucleotides encoding the polypeptides, optionally in combination with 
CA125 and polynucleotides encoding CA125 can have particular application in the detection of ovarian 
cancer. A plurality of kallikrein markers (i.e. two or more of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 
8, kallikrein 10, and kallikrein 11) and polynucleotides encoding the polypeptides, optionally in combination 
with CA125 and polynucleotides encoding CA125, constitute biomarkers for the diagnosis, monitoring, 
10 progression, treatment, and prognosis of ovarian cancer, and they may be used as biomarkers before surgery 
or after relapse. 

In accordance with the methods of the invention, the presence of levels of markers in a sample can 
be assessed, for example by detecting the presence in the sample of (a) polypeptides or polypeptide 
fragments corresponding to the markers; (b) metabolites which are produced directly or indirectly by 
15 polypeptides corresponding to the markers; (c) transcribed nucleic acids or fragments thereof having at least 
a portion with which the markers are substantially identical; and/or (c) transcribed nucleic acids or fragments 
thereof, wherein the nucleic acids hybridize with the markers. 

In an aspect of the invention, a method is provided for detecting ovarian cancer in a patient 
comprising detecting a plurality of kallikrein polypeptides, optionally in combination with CA125, in a 
20 sample from the patient wherein the method provides substantially increased sensitivity compared to 
methods usmg CA125 alone. In an embodiment, sensitivity is increased by at least 0.5%, 1%, 2%, 3%, 4%, 
5%, 10%, 15%, 20%, 25%, 30%, and 35% compared to using CA125 alone. 

In an embodiment, the invention provides a method for detecting a' plurality of kallikrein markers, 
and optionally CA125, associated with ovarian cancer in a patient comprising: 
25 (a) obtaining a sample from a patient; 

(b) detecting or identil^g in the sample kallikrem markers, optionally in combination with 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting 
of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(c) comparing the detected amounts with amounts detected for a standard. 

30 The term "detect" or "detecting" includes assaying, assessing, imaging or otherwise establishing the 

presence or absence of the target kallikrein and CA125 polypeptides or polynucleotides encoding the 
polypeptides, subunits thereof, or combinations of reagent bound targets, and the like, or assaying for, 
imaging, ascertaining, establishing, or otherwise determining one or more fectual characteristics of ovarian 
cancer, metastasis, stage, or similar conditions. The term encompasses diagnostic, prognostic, and 

35 monitoring applications. The kallikrein polypeptides and CA125 can be detected individually, sequentially, 
or simultaneously. 

According to a method involving kallikrein markers optionally in combination with CA125, the 
levels in the sample of the kallikrein markers (2, 3, 4, 5, or 6) and optionally CA125, wherein the markers 
comprise or are selected from kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 
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11, are compared with the normal levels of the kallikrein markers, and optionally CA125, in samples of the 
same type obtained from controls (e.g. samples from individuals not afflicted with ovarian cancer). 
Significantly different levels in the sample of the kallkrein markers (and optionally CA125) relative to the 
normal levels in a control is indicative of ovarian cancer. 
5 In an embodiment, the invention provides a method for diagnosing and monitoring ovarian 

carcinoma in a subject comprising detecting in a sample from the subject kallikrein markers, and optionally 
CA125, wherein the kallikrein markers comprise or arc selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kalhkrein 8, kallikrein 10, and kallikrein 1 1. The kallikrein markers and CAI25 can 
be detected using antibodies that bind to the kallikrein markers and CA 125 or parts thereof. 
10 Thus, the invention provides a method of assessing whether a patient is afQicted with or has a pre- 

disposition for ovarian cancer, the method comprising comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a sample from the patient, wherein 
the kallikrein markers comprise kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11; and 
15 (b) normal levels of kallikrein markers, and optionally CA125, in samples of the same type 

obtained from control patients not afQicted with ovarian cancer, wherein signifrcantly 
different levels of the kallikrein markers and optionally CA125, relative to the 
corresponding normal levels of the kallikrein markers, and optionally CA125, is an 
indication that the patient is afflicted with ovarian cancer. 
20 In an embodiment of a method of assessing whether a patient is afflicted vnth ovarian cancer (e.g. 

screening, detection of a recurrence, reflex testing), the method comprises comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; and 
25 (b) normal levels of the kallikrein markers, and optionally CA125, in a control non-ovarian 

cancer sample. 

A significant difference between the levels of the kallikreui markers, and optionally CA125, in the 
patient sample and the normal levels is an indication that the patient is afflicted with ovarian cancer. 

The invention further relates to a method of assessing the efficacy of a therapy for inhibiting 
30 ovarian cancer in a patient This method comprises comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a frrst sample obtained from the 
patient prior to providing at least a portion of the therapy to the patient, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11 ; and 
35 (b) levels of the kallikrein markers, and optionally CA125, in a second sample obtained from 

the patient following therapy. 
A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 
second sample, relative to the first sample, is an indication that the therapy is efflcacious for inhibiting 
ovarian cancer. 
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The "therapy" may be any therapy for treating ovarian cancer including but not limited to 
chemotherapy, immunotherapy, gene therapy, radiation therapy, and surgical removal of tissue. Therefore, 
the method can be used to evaluate a patient before, during, and after therapy, for example, to evaluate the 
reduction in tumor burden. 

5 In an aspect, the invention provides a method for monitoring the progression of ovarian cancer in a 

patient, the method comprising: 

(a) detecting in a patient sample at a first time point, kallikrein markers, and optionally 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting 
of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; and 

10 Q}) repeating step (a) at a subsequent point in time; and 

(c) comparing the levels detected in (a) and (b), and therefrom monitoring the progression of 

ovarian cancer in the patient. 
In another aspect, the invention provides a method for assessing the aggressiveness or indolence of 
ovarian cancer (e.g. staging), the method comprising comparing: 
15 (a) levels of kallikrein markers, and optionally CA12S, in a patient sample, wherein the 

kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; and 

(b) normal levels of the kallikrein markers, and optionally CA125 in a control sample. 

A significant difference between the levels in the sample and the normal levels is an indication that 
20 the cancer is aggressive or indolent. 

The invention provides a method for determining whether ovarian cancer has metastasized or is 
likely to metastasize in the future, the method comprising comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 
kallikrein markers comprise or are selected from the group consistmg of kallikrein 5, 

25 kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; and 

(b) normal levels (or non-metastatic levels) of the kallikrein markers, and optionally CA125, 
in a control sample. 

A significant difference between the levels in the patient sample and the normal levels is an 
indication that the cancer has metastasized or is likely to metastasize in the future. 
30 The invention also provides a method for assessing the potential efficacy of a test agent for 

inhibiting ovarian cancer in a patient, and a method of selecting an agent for inhibiting ovarian cancer in a 



The invention flullier provides a me&od of inhibiting ovarian cancer in a patient comprising: 
(a) obtaining a sample comprising cancer cells from the patient; 
35 (b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing levels of kallikrein markers, and optionally CA125, in each of the aliquots, 
wherein the kallikrein markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; 

(d) administering to the patient at least one of the test agents which alters the levels of the 
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kallikrein markers, and optionally CA125, in the aliquot containing that test agent, relative 
to other test agents. 

The invention also contemplates a method of assessing ihe ovarian carcinogenic potential of a test 
compoimd comprising: 

5 (a) maintaining sepai'ate aliquots of ovarian cells in the presence and absence of tlie test 

compound; and 

(b) comparing levels of kallikrein markers, and optionally CA125, in each of the aliquots, 
wherein the markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1. 
10 A significant difference between ihe levels of the kallikrein markers, and optionally CA125, in the 

aliquot maintained in the presence of (or exposed to) the test compound relative to the aliquot maintained in 
the absence of the test compound, indicates that the test compound possesses ovarian carcinogenic potential. 

In preferred embodiments of the methods of the invention, the kallikrein markers comprise a 
plurality of kallikrein markers, for example, at least three, four, five, or six of the markers. In particular, a 
15 plurality of kallikrein markers may be selected from the group consisting of kallikrein 5, kallikrein 7, 
kallikrein 8, and kallikrein 10, from the group consisting of kallikrein 7, kallikrein 8, kallikrein 10, and 
kallikrein 1 1, or from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 
and kallikrein 11. 

Other methods of the invention employ one or more polynucleotides capable of hybridizing to 
20 polynucleotides encoding kallikrein markers, and optionally CA125. Methods for detecting polynucleotides 
encoding a kallikrein markers, and optionally CA12S, can be used to monitor ovarian cancer by detecting the 
nucleic acids. 

Thus, the present invention relates to a method for diagnosing and monitoring ovarian cancer in a 
sample from a subject comprising isolating nucleic acids, preferably mRNA, from the sample; and detecting 
25 polynucleotides encoding kallikrein markers, and optionally CA125, in the sample. The presence of different 
levels of polynucleotides encoding kallikrein markers, and optionally CA125, in the sample compared to a 
standard or control is indicative of disease, disease stage, and/or prognosis, e.g. longer progression-free and 
overall survival. 

In an embodiment, the invention provides methods for determining the presence or absence of 
30 ovarian cancer in a subject comprising (a) contacting a sample obtained from the subject with 
oligonucleotides that hybridize to polynucleotides encoding kallikrein markers, and optionally CA125; and 
(b) detecting in the sample levels of nucleic acids that hybridize to die polynucleotides relative to a 
predetermined cut-ofif value, and therefrom determining the presence or absence of ovarian cancer in the 
subject Within certain embodiments, mRNA is detected via polymerase chain reaction using, for example 
35 oligonucleotide primers that hybridize to polynucleotides encoding kallikrein markers, and optionally 
CA125, or complements of such polynucleotides. Within other embodiments, the amount of mRNA is 
detected using a hybridization technique, employing oligonucleotide probes that hybridize to polynucleotides 
encoding kallikrein markers, and optionally CA125, or complements of such polynucleotides. 

When using mRNA detection, the method may be carried out by combining isolated mRNA with 
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reagents to convert to cDNA according to standard methods; treating the converted cDNA with amplification 
reaction reagents (such as cDNA PGR reaction reagents) in a container along with an appropriate mixture of 
nucleic acid primers; reacting Ae contents of the container to produce amplification products; and analyzing 
the amplification products to detect the presence of polynucleotides encoding kallikrein markers, and 
5 optionally CA125. in the sample. For mRNA the analyzing step may be accomplished using Northern Blot 
analysis to detect the presence of polynucleotides encoding kallikrein markers, and optionally CA125. The 
analysis step may be further accomplished by quantitatively detecting the presence of polynucleotides 
encoding kallikrein markers, and optionally CA125, in the amplification product, and comparing the quantity 
of markers detected against a panel of expected values for the known presence or absence of the kallikrein 
10 markers in normal and malignant tissue derived using similar primers. 

In embodiments of the methods of the invention, a plurality (eg. three, four, five or six) 
polynucleotides encoding kallikrein polypeptides are employed. In particular, a plurality of polynucleotides 
encoding kallikrein markers may be selected from the group consisting of polynucleotides encoding (i) 
kallikrein 5, kallikrein 7, kallikrein 8, and kallikrein 10; (ii) polynucleotides encoding kallikrein 7, kallikrein 
15 8, kallikrein 10, and kallikrein 11; and (iii) polynucleotides encloding kallikrein 5, kallikrein 6, kallikrein7, 
kallikrein 8, kallikrein 10 and kallikrein 1 1. 

The invention also provides a diagnostic composition comprising a plurality of kallikrein 
polypeptides and optionally CA125 polypeptide, or polynucleotides encoding the polypeptides, or agents that 
bind to the polypeptides or polynucleotides. 
20 In an embodiment, the composition comprises probes that specifically hybridize to polynucleotides 

encoding kallikrein markers, and optionally CA125, or fiagments thereof. In another embodiment a 
composition is provided comprising specific primer pairs capable of amplifying polynucleotides encoding 
kallikrein markers, and optionally CA125, using polymerase chain reaction methodologies. In a still further 
embodiment, the composition comprises agents that bind to kallikrein markers, and optionally CA125, (e.g. 
25 antibodies) or fragments thereof. Probes, primers, and agents can be labeled with detectable substances. 

In an aspect the mvention provides an in vivo method comprising administering to a subject agents 
that have been constructed to target kallikrein markers, and optionally CA125. 

The invention therefore contemplates an in vivo method comprising administering to a mammal 
imaging agents that carry labels for imaging and that bind to kallikrein markers, and optionally CA125, and 
30 then imaging the mammal. 

Still fiirtiier the invention relates to therapeutic applications for ovarian cancer employing kallikrein 
markers, and optionally CA125, nucleic acids encoding the polypeptides, and/or agents identified using 
methods of the invention. 

The invention also includes kits for carrying out metliods of the invention. In an embodiment, the 
35 Icit is for assessing whether a patient is afflicted with ovarian cancer and it comprises reagents for assessing 
kallikrein markers, and optionally CA125, wherein the kallikrein markers comprise or are selected fi-om the 
group consising of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 

In another aspect the invention relates to a kit for assessing the suitability of each of a plurality of 
test compounds for inhibiting ovarian cancer in a patient The kit comprises reagents for assessing kallikrein 
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markers, and optionally CA125, wherein the markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. The kit may also 
comprise a plurality of test agents or compounds. 

The invention contemplates a kit for assessing the presence of ovarian cancer cells, wherein tiie kit 
5 comprises antibodies specific for selected kallikrein markers, and optionally CA125, wherein the markers 
comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11. 

Additionally the invention provides a kit for assessing the ovarian carcinogenic potential of a test 
compound. The kit comprises ovarian cells and reagents for assessing kallikrein markers, and optionally 
10 CA125, wherein the markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 
kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 

In an aspect the invention provides a method of treating a patient afflicted with ovarian cancer 
comprising providing to cells of a patient antisense oligonucleotides complementary to polynucleotides 
encoding kallikrein markers, and optionally CA125, which are overexpressed in ovarian cancer. In an 
15 alternative method, expression of genes corresponding to kallikrein markers, and optionally CA125, which 
are underexpressed in ovarian cancer are increased. 

The invention relates to a method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer comprising inhibiting or increasing expression (or overexpression) of genes encoding 
kallikrein markers and optionally CA125, wherein the markers comprise or are selected from the group 
20 consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, that are 
either overexpressed or underexpressed, in ovarian cancer. 

Other objects, features and advantages of the present invention will become apparent from the 
following detailed description. It should be understood, however, that the detailed description and the 
specific examples while indicating preferred embodiments of the invention are given by way of illustration 
25 only, since various changes and modifications within the spirit and scope of the invention will become 
apparent to those skilled in the art fit>m this detailed description. 
DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in which 

Figure 1 is a graph showing hk5 concentration in serum from non-cancer and cancer patients. 
30 Figure 2 is a graph showing hk6 concentration in serum from non-cancer and cancer patients. 

Figure 3 is a graph showing hk7 concentration in serum from non-cancer and cancer patients. 

Figure 4 is a graph showing hkS concentration in serum from non-cancer and cancer patients. 

Figure 5 is a graph showing hklO concentration in serum from non-cancer and cancer patients. 

Figure 6 is a graph showing hkl 1 concentration in serum from non-cancer and cancer patients. 
35 Figure 7 is a graph showing CA125 concentration in serum from non-cancer and cancer patients. 

Figure 8 is a ROC curve illustrating the added value of using kallikreins and CA125 together in a 
multivariate function. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to newly discovered correlations between expression of certain markers and 
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ovarian cancer. The combinations of markers described herein may provide sensitive methods for detecting 
ovarian cancer. The levels of expression of a combination of markers described herein may coixelate with the 
presence of ovarian cancer or a pre-malignant condition in a patient Methods are provided for detecting the 
presence of ovarian cancer in a sample, tfie absence of ovarian cancer in a sample, the stage of an ovarian 
5 cancer, the grade of an ovarian cancer, the benign or malignant nature of an ovarian cancer, the metastatic 
potential of an ovarian cancer, assessing the histological type of neoplasm associated with the ovarian 
cancer, the indolence or aggressiveness of the cancer, and other characteristics of ovarian cancer that are 
relevant to prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient. Methods are 
also provided for assessing the efficacy of one or more test agents for inhibiting ovarian cancer, assessing the 
10 efficacy of a therapy for ovarian cancer, monitoring the progression of ovarian cancer, selecting an agent or 
therapy for inhibiting ovarian cancer, treating a patient afflicted with ovarian cancer, inhibiting ovarian 
cancer in a patient, and assessing the carcinogenic potential of a test compound. 
Glossary 

The terms "sample", "biological sample", and the like, mean a material known or suspected of 
15 expressing or containing a plurality of kallikrien markers or polypeptides (2, 3, 4, 5, or 6 polypeptides), and 
optionally CA125 polypeptide, or polynucleotides encoding the polypeptides. The test sample can be used 
directiy as obtained from the source or following a pretreatment to modify the character of the sample. The 
sample can be derived from any biological source, such as tissues, extracts, or cell cultures, including cells 
(e,g. tumor cells), cell lysates, and physiological fluids, such as, for example, whole blood, plasma, serum, 
20 saliva, ocular lens fluid, cerebral spmal fluid, sweat, urine, milk, ascites fluid, synovial fluid, peritoneal fluid 
and tiie like. The sample can be obtained from animals, preferably mammals, most preferably humans. The 
sample can be treated prior to use, such as preparing plasma from blood, diluting viscous fluids, and the like. 
Methods of treatment can involve filtration, distillation, extraction, concentration, inactivation of interfering 
components, the addition of reagents, and the like. Nucleic acids and polypeptides may be isolated from the 
25 samples and utilized in the methods of the invention. In a preferred embodiment, the sample is a serum 
sample. 

The term "subject" or "patienf * refers to a warm-blooded anunal such as a mammal, which is 
suspected of having ovarian cancer, or a condition, disease, or syndrome associated with ovarian cancer. 
Preferably, "subject" refers to a human. 

30 "CA125", "CA125 polypeptide", or "carbohydrate antigen 125" refers to a high-molecular weight 

mucin, which can be defined by its ability to bind to monoclonal antibody OC125. The CA125 protein core 
comprises a short cytoplasmic core tail, a transmembrane domain, and a large and heavily glycosylated 
extracellular domain dominated by a repeat domain of 156 amino acids rich in serine, threonine, and proline 
(Yin BW and Lloyd KO, J Biol Chem. 2001, 276:27371-27375; O'Brian TJ et al, Tumor Biol, 2001 22:348- 

35 366; and Hovig E. et al. Tumor Biol. 2001, 22:345-347). The sequence of CA125 is shown in GenBank 
Accession No. NP_078966, AAL65133 and AF414442 (SEQ ID NO. 1). The term includes tiie native- 
sequence polypeptides, isoforms, precursors and chimeric polypeptides. The term also includes the native 
sequence polypeptide, including polypeptide variants and polypeptides with substantial sequence identity 
(e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 
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99% sequence identity) to the sequence of GenBank Accession No.NP_078966 (SEQ ID NO. 1), and that 
preferably retain the immunogenic activity of the corresponding native sequence polypeptide. 

"Kallikrein polypeptides" or "kallikrein markers" comprise kallilkrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11. The term includes the native-sequence polypeptides, isoforms, 
5 precursors and chimeric polypeptides. The amino acid sequences for native kallikrein polypeptides employed 
in the present invention include the sequences found in GenBank for each polypeptide as shown in Table 1, 
and in SEQ ID NO: 3 (kallilkrein 5), N0.6 (kallikrein 6), NO. 10 (kallikrein 7), NO. 13 (kallikrein 8), NO. 
16 (kallikrein 10), and NOs. 19 and 20 (kallikrein 11), or a portion thereof. Other useful polypeptides are 
substantially identical to these sequences (e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 

10 75%, 80%, 85%. 90%, 95%, 97%, 98%, or 99% sequence identity), and preferably retain the immunogenic 
activity of the corresponding native-sequence kallikrein polypeptide. 

A "native-sequence polypeptide" comprises a polypeptide having the same amino acid sequence of 
a polypeptide derived from nature. Such native-sequence polypeptides can be isolated from nature or can be 
produced by recombinant or synthetic means. 

15 The term "native-sequence polypeptide" specifically encompasses naturally occurring truncated or 

secreted forms of a polypeptide, polypeptide variants including naturally occurring variant forms (e.g., 
alternatively spliced forms or splice variants), and naturally occurring allelic variants. 

The term "polypeptide variant" means a polypeptide having at least about 70-80%, preferably at 
least about 85%, more preferably at least about 90%, most preferably at least about 95% amino acid 

20 sequence identity with a native-sequence polypeptide, in particular having at least 70-80%, 85%, 90%, 95% 
amino acid sequence identity to the sequences identified in the GenBank Accession Nos. in Table 1 and 
Accession No. NP_078966, AF414442 and AAL65133 and shown in SEQ ID NOS: 1, 2, 3, 6, 10. 13, 16, 19 
and 20. Such variants include, for instance, polypeptides wherein one or more amino acid residues are 
added to, or deleted from, the N- or C-terminus of the full-length or mature sequences of SEQ ID NOS: 1, 2, 

25 3, 6, 10, 13, 16, 19 and 20, including variants from other species, but excludes a native-sequence 
polypeptide. 

An allelic variant may also be created by introducing substitutions, additions, or deletions into a 
nucleic acid encoding a native polypeptide sequence such that one or more amino acid substitutions, 
additions, or deletions are introduced into the encoded protein. Mutations may be introduced by standard 

30 methods, such as site-directed mutagenesis and PCR-mediated mutagenesis. In an embodiment, conservative 
substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino 
acid substitution" is one in which an animo acid residue is replaced with an amino acid residue with a similar 
side chain. Amino acids with similar side chains are known in the art and include amino acids with basic side 
chains (e.g. Lys, Arg, His), acidic side chains (e.g. Asp, Glu), uncharged polar side chains (e.g. Gly, Asp, 

35 Glu, Ser, Thr, Tyr and Cys), nonpolar side chains (e.g. Ala, Val, Leu, Iso, Pro, Trp), beta-branched side 
chains (e.g. Thr, Val, Iso), and aromatic side chains (e.g. Tyr, Phe, Trp, His). Mutations can also be 
introduced randomly along part or all of the native sequence, for example, by saturation mutagenesis. 
Following mutagenesis the variant polypeptide can be recombinantly expressed and the activity of the 
polypeptide may be determined. 



f 
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Polypeptide variants include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of a native polypeptide which include fewer amino acids than the 
full length polypeptides, A portion of a polypeptide can be a polypeptide which is for example, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or more amino acids in length. Portions in which regions of a 
5 polypeptide are deleted can be prepared by recombinant techniques and can be evaluated for one or more 
functional activities such as the ability to form antibodies specific for a polypeptide. 

A naturally occurring allelic variant may contain conservative amino acid substitutions from the 
native polypeptide sequence or it may contain a substitution of an amino acid from a corresponding position 
in a CA125 or kallikrein polypeptide homolog, for example, tfie murine CA125 or kallikrein polypeptide. 

10 Percent identity of two amino acid sequences, or of two nucleic acid sequences identified herein is 

defined as the percentage of amino acid residues or nucleotides in a candidate sequence ^at are identical 
with the amino acid residues in a CA125 or kallikrein polypeptide or nucleic acid sequence, after aligning 
the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not 
considering any conservative substitutions as part of the sequence identity. AUgnment for purposes of 

15 determining percent amino acid or nucleic acid sequence identity can be achieved in various conventional 
ways, for instance, using publicly available computer software including the GCG program package 
(Devereux J. et al, Nucleic Acids Research 12(1): 387, 1984); BLASTP, BLASTN, and FASTA (Atschul, 
S.F. et al. J. Molec. Biol. 215: 403-410, 1990). The BLAST X program is publicly available from NCBI and 
other sources (BLAST Manual, Altschul, S. et al. NCBI NLM NIH Bethesda, Md. 20894; Altschul, S. et al. 

20 J. Mol. Biol. 215: 403-410, 1990). Skilled artisans can determine appropriate parameters for measuring 
alignment, mcluding any algorithms needed to achieve maximal alignment over the full length of the 
sequences being compared. Methods to determine identity and similarity are codified m publicly available 
computer programs. 

CA125 and kallikrien polypeptides include chimeric or fusion proteins. A "chimeric protein" or 
25 "fusion protein" comprises all or part (preferably biologically active) of a CA125 or kallikrein polypeptide 
operably linked to a heterologous polypeptide (i.e., a polypeptide other ttian the same CA125 or kallikrein 
polypeptide). Within the fijsion protein, the terra "operably linked" is intended to indicate that the CA125 or 
kallikrein polypeptide and the heterologous polypeptide are fused in-frame to each other. The heterologous 
polypeptide can be fused to the N-terminus or Oterminus of the CA125 or kallikrein polypeptide. A useful 
30 fusion protein is a GST fusion protein in which a kallikrein polypeptide is fused to the C-terminus of GST 
sequences. Another example of a fusion protein is an immunoglobulin fusion protein in which all or part of a 
CA125 or kallilcrein polypeptide is fused to sequences derived fit>m a member of the immunoglobulin 
protein family. Chimeric and fusion proteins can be produced by standard recombinant DNA techniques. 

CA125 and kallikrein polypeptides may be isolated from a variety of sources, such as from human 
35 tissue types or from another source, or prepared by recombinant or synthetic methods, or by any combination 
of these and similar techniques. 

"CA125 polynucleotides" or '•polynucleotides encoding CA125" include nucleic acids tiiat encode a 
native-sequence polypeptide, a polypeptide variant including a portion of a CA125 polypeptide, an isoform, 
precursor, and chimeric polypeptide. A nucleic acid sequence encoding native CA125 employed in the 
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present invention includes the nucleic acid sequence in GenBank Accession No. AF4 14442 and SEQ ID NO. 
2, or a fragment thereof. 

"Kallikrein polynucleotides" or "'polynucleotides encoding kallikrein markers/polypeptides" refers 
to kallilkrein 5 nucleic acids (KLK5), kallikrein 6 nucleic acids (KLK6), kallikrein 7 nucleic acids (KLK7), 
5 kallikrein 8 nucleic acids (KLKS), kallikrein 10 nucleic acids (KLKIO), and/or kallilcrein 11 nucleic acids 
(KLKl 1). The term includes nucleic acids that encode a native-sequence polypeptide, a polypeptide variant 
including a portion of a kallikrein polypeptide, an isoform, precursor, and chimeric polypeptide. 

The polynucleotide sequences encoding native kallikrein polypeptides employed in the present 
invention include the nucleic acid sequences of the GenBank Accession Nos. identified in Table 1. and in 
10 SEQ ID NOs: 4 and 5 (KLK5), NOs. 7, 8, and 9 (KLK6), NOs. 1 1 and 12 (KLK 7), NOs. 14 and 15 (KLK8), 
NOs. 17 and 18 (KLKIO), and NOs. 21 and 22 (KLKl 1), or a fragment thereof. 

Polynucleotides encoding kallikrien polypeptides and CA125 include nucleic acid sequences 
complementary to these polynucleotides, and polynucleotides that are substantially identical to these 
sequences (e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%,90%, 95%, 97%, 
15 98%, or 99% sequence identity). 

CA125 and kallikrein polynucleotides also include sequences which differ from a nucleic acid 
sequence of GenBank Accession Nos. identified in Table 1 and SEQ ID NOS: 2, 4, 5, 7, 8, 9, 11, 12, 14, 15, 
17, 18, 21, and 22, due to degeneracy in the genetic code. As one example, DNA sequence polymorphisms 
within the nucleotide sequence of a CA 125 or kallikrein polypeptide may resuU in silent mutations which do 
20 not affect the amino acid sequence. Variations in one or more nucleotides may exist among individuals 
within a population due to natural alleHc variation. DNA sequence polymorphisms may also occur which 
lead to changes in the amino acid sequence of CA125 or a kallikrein polypeptide. 

CA125 and kallikrein polynucleotides also include nucleic acids that hybridize under stringent 
conditions, preferably high stringency conditions to a nucleic acid sequence of the GenBank Accession Nos. 
25 identified m Table 1 and SEQ ID NOS: 2, 4, 5, 7. 8, 9, 11, 12, 14, 15, 17, 18, 21, and 22. Appropriate 
stringency conditions which promote DNA hybridization are known to those skilled in the art, or can be 
found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For 
example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 x SSC at 
50°C may be employed. The stringency may be selected based on the conditions used in the wash step. By 
30 way of example, the sah concentration in the wash step can be selected from a high stringency of about 0.2 x 
SSC at 50**C. In addition, the temperature in the wash step can be at high stringency conditions, at about 

es'^c. 

CA125 and kallikrein polynucleotides also include truncated nucleic acids or fragments and variant 
forms of the polynucleotides that arise by alternative splicing of an mRNA corresponding to a DNA. 
35 The CA125 and kallikrien polynucleotides are intended to include DNA and RNA (e.g. mRNA) and 

can be either double stranded or single stranded. A polynucleotide may, but need not, include additional 
coding or non-coding sequences, or it may, but need not, be linked to other molecules and/or carrier or 
support materials. The polynucleotides for use in the methods of the invention may be of any length suitable 
for a particular method. 
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A purality of kallikrein polypeptides or kallikrein polynucleotides are generally detected in the 
present invention. "Plurality" refers to 2, 3, 4, 5, or 6 kallikrein polypeptides or polynucleotides, in particular 
3, 4, 5, or 6, preferably 4, 5, or 6, more preferably 5 or 6 kallikrein polypeptides or polynucleotides. 

In an embodiment a plurality of kallikrein polypeptides is selected from the group consisting of 
5 kallikrein 5, kallikrein 7, and kallikrein 8; kallikrein 5, kallikrein 8, and kallikrein 10; kallikrein 1, kallikrein 
8, and kallikrein 10; kallikrein 5, kallikrein 7, kallikrein 8, and kallikrein 10; kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11; or kallikrein 5, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. In 
another embodiment, a plurality of kallikrein polypeptides is selected from the group consisting of kallilkrein 
5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and kallikrein 1 1. 

10 In an embodiment, a pluraity of kallikrein polynucleotides is selected from the group consisting of 

KLKS, KLK7, and KLK8; KLK5, KLK8 and KLKIO; KLK7, KLBC8 and KLKIO; KLK5, KLK7, KLK8, and 
KLKIO; KLK7, KLKS, KLKIO and KLKU, or KLKS, KLK7, KLKS, KLKIO and KLKll. In another 
embodiment, a plurality of kallikrein polynucleotides is selected from the group consisting of KLK5, KLK6, 
KLK7, KLKS, KLKIO, and KLKl 1. 

15 General Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of ovarian 
cancer involving kallikrein polypeptides, and optionally CA125 polypeptide, and polynucleotides encoding 
the polypeptides, and the identification of subjects with a predisposition to such disorders. Such methods 
may, for example, utilize polynucleotides encoding kallikrein polypeptides, and optionally CA125, and 

20 fragments thereof, and binding agents (e.g. antibodies aptamers) against kallikrein polypeptides, and 
optionally CA125 polypeptide, including peptide fragments. In particular, the polynucleotides and antibodies 
may be used, for example, for (1) the detection of either over- or under-expression of kallikrein 
polynucleotides, and optionally CA125, relative to a non-disorder state; and (2) the detection of either an 
over- or an under-abundance of kallikrein polypeptides, and optionally CA125, relative to a non-disorder 

25 state or the presence of modified (e.g., less than fiill length) kallikrein polypeptides, and optionally CA125, 
that correlate with a disorder state, or a progression toward a disorder state. 

Hie invention also contemplates a method for detecting ovarian cancer comprising producing a 
profile of levels of a plurality of kallikrein markers, and optionally CA125, in cells from a patient, wherein 
the markers are kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

30 comparing the profile with a reference to identify a protein profile for the test cells indicative of disease. 

The methods described herein may be used to evaluate the probability of the presence of malignant 
or pre-malignant cells, for example, in a group of cells fireshly removed firom a host. Such methods can be 
used to detect tumors, quantitate their growth, and help in the diagnosis and prognosis of disease. The 
methods can be used to detect the presence of cancer metastasis, as well as confirm the absence or removal 

35 of all tumor tissue following surgery, cancer chemotherapy, and/or radiation therapy. They can fiirther be 
used to monitor cancer chemotherapy and tumor reappearance. 

The methods described herein can be adapted for diagnosing and monitoring ovarian cancer by 
detecting a plurality of kallikrein polypeptides, and optionally CA125 polypeptide, or nucleic acids encoding 
the polypeptides in biological samples from a subject These applications require that the amount of 
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polypeptides or nucleic acids quantitated in a sample jfrom a subject being tested be compared to a 
predetermined standard. The standard may correspond to levels quantitated for another sample or an earlier 
sample from the subject, or levels quantitated for a control sample. Levels for control samples from healthy 
subjects or ovarian cancer subjects may be established by prospective and/or retrospective statistical studies. 

5 Healthy or normal subjects who have no clinically evident disease or abnormalities may be selected for 
statistical studies. Diagnosis may be made by a finding of statistically different levels of a plurality of 
kallikrein polypeptides, and optionally CA125, or nucleic acids encoding same, compared to a control 
sample or previous levels quantitated for the same subject. A "significant difference" in levels of kallikrein 
markers or polynucleotides encoding the kallikrein markers in a patient sample compared to a control or 

10 standaixl (6.g. normal levels or levels in other samples from a patient) may represent levels that are higher or 
lower than the standard error of the detection assay, preferably the levels are at least about 1.5, 2, 3, 4, 5, or 6 
times higher or lower, respectively, than the control or standard. The difference in levels of markers or 
polynucleotides may be a "statistically significant difference" 
Nucleic Acid Methods/Assays 

15 As noted herein an ovarian cancer may be detected based on the levels of polynucleoitdes encoding 

kallikrein polypeptides, and optionally CA125, in a sample. Techniques for detecting polynucleotides such 
as polymerase chain reaction (PGR) and hybridization assays are well known in the art. 

Nucleotide probes for use in the detection of nucleic acid sequences in samples may be constructed 
using conventional methods known in the art. Suitable probes may be based on nucleic acid sequences 

20 encoding at least 5 sequential amino acids from regions of nucleic acids encoding kallikrein polypeptides, 
and optionally CA125, preferably they comprise 15 to 40 nucleotides. A nucleotide probe may be labeled 
with a detectable substance such as a radioactive label that provides for an adequate signal and has sufficient 
half-life such as ^^P, ^H, or the like. Other detectable substances that may be used include antigens that 
are recognized by a specific labeled antibody, fluorescent compounds, enzymes, antibodies specific for a 

25 labeled antigen, and luminescent compounds. An appropriate label may be selected having regard to the rate 
of hybridization and binding of the probe to the nucleotide to be detected and the amount of nucleotide 
available for hybridization. Labeled probes may be hybridized to nucleic acids on solid supports such as 
nitrocellulose filters or nylon membranes as generally described in Sambrook et al, 1989, Molecular 
Cloning, A Laboratory Manual (2nd ed.). The nucleic acid probes may be used to detect polynucleoitides 

30 encoding kallikrein polypeptides, and optionally CA125, preferably in human cells. The nucleotide probes 
may also be useful in the diagnosis of ovarian cancer involving polynucleoitides encoding kallikrein 
polypeptides, and optionally CA125, in monitoring the progression of such disorder; or monitoring a 
therapeutic treatment. 

Probes may be used in hybridization techniques to detect nucleic acids encoding a plurality of 
35 kallikrein polypeptides, and optionally CA125. The technique generally involves contacting and incubating 
nucleic acids (e.g. recombinant DNA molecules, cloned genes) obtained from a sample from a patient or 
other cellular source with probes under conditions frivorable for the specific annealing of the probes to 
complementary sequences in the nucleic acids. After incubation, the non-annealed nucleic acids are 
removed, and the presence of nucleic acids that have hybridized to the probe if any are detected. 
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The detection of polynucleotides encoding kallikrein polypeptides and optionally CA125, may 
involve the amplification of specific gene sequences using an amplification method such as polymerase 
chain reaction (PGR), followed by the analysis of the amplified molecules using techniques known to those 
skilled in the art. Suitable primers can be routinely designed by one of skill in the art. 

5 By way of example, oligonucleotide primers may be employed in a PGR based assay to amplify a 

portion of nucleic acids encoding each of a plurality of kallikrein polypeptides, and optionally CA125, 
derived from a sample, wherein the oligonucleotide primers are specific for (i.e. hybridize to) 
polynucleotides encoding each of the plurality of kallikrein polypeptides, and optionally CA125, The 
amplified cDNA is then separated and detected using techniques well known in the art, such as gel 

10 electrophoresis. 

In order to maximize hybridization under assay conditions, primers and probes employed in the 
methods of the invention generally have at least about 60%, preferably at least about 75% and more 
preferably at least about 90% identity to a portion of polynucleotides encoding a plurality of kallikrein 
polypeptides, and CA125. The primers and probes may be at least 10 nucleotides, and preferably at least 20 
15 nucleotides in length. In an embodiment the primers and probes are at least about 10-40 nucleotides in 
length. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of expression of polynucleotides encoding kallikrein polypeptides, and optionally 
CA125. For example, KNA may be isolated fi-om a cell type or tissue known to express these 
20 polynucleotides and tested utilizing the hybridization (e.g. standard Northern analyses) or PGR techniques 
referred to herein. 

The primers and probes may be used in the above>described methods in situ i.e directly on tissue 
sections (fixed and/or fi-ozen) of patient tissue obtained fi-om biopsies or resections. 

In an aspect of the invention, a method is provided employing reverse transcriptase-polymerase 
25 chain reaction (RT-PCR), in which PGR is applied in combination with reverse transcription. Generally, 
RNA is extracted fi-om a sample tissue using standard techniques (for example, guanidine isothiocyanate 
extraction as described by Ghomcynski and Sacchi, Anal. Biochem. 162:156-159, 1987) and is reverse 
transcribed to produce cDNA. The cDNA is used as a template for a polymerase chain reaction. The cDNA 
is hybridized to sets of primers specifically designed against each of a plurality of kallikrein polynucleotide 
30 sequences, and optionally CA125. Once the primer and template have annealed a DNA polymerase is 
employed to extend fi-om the primer, to synthesize a copy of the template. The DNA strands are denatured, 
and the procedure is repeated many times until sufficient DNA is generated to allow visualization by 
ethidium bromide staining and agarose gel electrophoresis. 

Amplification may be performed on samples obtained from a subject with suspected ovarian cancer 
35 and an individual who is not afflicted with ovarian cancer. The reaction may be performed on several 
dilutions of cDNA spanning at least two orders of magnitude. A statistically significant difference in 
expression in several dilutions of the subject sample as compared to the same dilutions of the non-cancerous 
sample may be considered positive for the presence of ovarian cancer. 

Oligonucleotides or longer firagments derived from polynucleotides encoding each of a plurality of 
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kallikrein polypeptides and optionally CA125, may be used as targets in a microarray. The microarray can be 
used to simultaneously monitor the expression levels of large numbers of genes. The information from the 
microarray may be used to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. 

5 The preparation, use, and analysis of microarrays are well known to a person skilled in the art (See, 

for example, Brennan, T. M. et al (1995) U.S. Pat. No. 5,474,796; Schena, et al, (1996) Proc. Natl. Acad 
Sci. 93:10614-10619; Baldeschweiler et al. (1995), PCT Application W095/251116; Shalon, D. et al. (I 995) 
PCT application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad Sci. 94:2150-2155; and Heller, 
M. J. et al. (1997) U.S. Pat No. 5,605,662.) 

10 Thus, the invention also includes an array comprising a plurality of polynucleotides encoding 

kallikrein marker(s), and optionally CAI25 polynucleotides. The array can be used to assay expression of 
kallikrein polynucleotides, and optionally CA125 polynucleotides in the array. The invention allows the 
quantitation of expression of a plurality of kallikrein polynucleotides, and optionally CA125 polynucleotides. 
In an embodiment, the array can be used to monitor the time course of expression of a plurality of 

15 kallikrein polynucleotides, and optionally CA125 polynucleotides, in tiie array. This can occur in various 
biological contexts such as tumor progression. 

The array is also useful for ascertaining differential expression patterns of a plurality of kallikrein 
polynucleotides and optionally CA125 polynucleotides, in normal and abnormal cells. This provides a 
battery of polynucleotides that could serve as molecular targets for diagnosis or therapeutic intervention. 

20 Protein Methods 

Binding agents specific for a plurality of kallikrein markers and CA125 may be used for a variety of 
diagnostic and assay applications. There are a variety of assay formats known to the skilled artisan for using 
a binding agent to detect a target molecule in a sample. (For example, see Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988). In general, the presence or absence of an ovarian 
25 cancer in a subject may be detamined by (a) contacting a sample from the subject with binding agents for a 
plurality of kallikrem polypeptides, and optionally CA125; (b) detecting in the sample levels of polypeptides 
that bind to the binding agents; and (c) comparing the levels of polypeptides with a predetermined standard 
or cut-off value. 

"Binding agent" refers to a substance such as a polypeptide or antibody that specifically binds to a 
30 kallikrein or CA125 polypeptide. A substance "specifically binds" to a polypeptide if it reacts at a detectable 
level with the kallikrein or CA125 polypeptide, and does not react detectably with peptides containing 
unrelated sequences or sequences of different polypeptides. Binding properties may be assessed using an 
ELISA, which may be readily performed by those skilled in tiie art (see for example, Newton et al , Develop. 
Dynamics 197: 1-13, 1993). 

35 A binding agent may be a ribosome, with or without a peptide component, an aptamer, an RNA 

molecule, or a polypeptide. A binding agent may be a polypeptide that comprises a kallikrein polypeptide or 
CA125 polypeptide sequence, a peptide variant thereof, or a non-peptide mimetic of such a sequence. By 
way of example a kallikrein polypeptide sequence may be a peptide portion of a kallikrein polypeptide that is 
capable of modulating a function mediated by the kallikrein polypeptide. 
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An aptamer includes a DNA or RNA molecule that binds to polynucleotides and polypeptides. An 
aptamer that binds to a polypeptide (or binding domain) of a kallikrein polypeptide or a polynucleotide 
encoding a kallikrein polypeptide can be produced using conventional techniques, without undue 
experimentation. [For example, see the following publications describing in vitro selection of aptamers: Klug 
et al., Mol. Biol. Reports 20:97-107 (1994); Wallis et al, Chem. Biol. 2:543-552 (1995); Ellington, Curr. 
Biol. 4:427-429 (1994); Lato et al, Chem. Biol. 2:291-303 (1995); Conrad et al, Mol. Div. 1:69-78 (1995); 
and Uphoff et al., Curr. Opin. Struct Biol. 6:281-287 (1996)]. 

In certain other preferred embodiments, the binding agent is an antibody. 

In an aspect the present invention provides a diagnostic method for monitoring or diagnosing 
ovarian cancer in a subject by quantitating a plurality of kallikrein polypeptides, and optionally CA125, in a 
biological sample from the subject comprising reacting the sample with antibodies specific for a plurality of 
kallikrein polypeptides, and optionally CA125, wliich are directly or indirectiy labelled witii detectable 
substances, and detecting the detectable substances. 

In an aspect of the invention, a method for detecting ovarian cancer is provided comprising: 

(a) obtaining a sample suspected of contaming a plurality of kallikrein polypeptides, and 
optionally CA125, wherein the kallikrein polypeptides comprise or are selected from the 
group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and 
kallikrein 11; 

(b) contacting the sample with antibodies that specifically bind to the plurality of kallikrein 
polypeptides, and optionally CA125, under conditions effective to bind the antibodies and 
form complexes; 

(c) measuring the amount of kallikrein polypeptides, and optionally CA125, present in the 
sample by quantitating the amount of the complexes; and 

(d) comparing the amount of kallikrein polypeptides, and optionally CA125, present in the 
samples witii the amount of polypeptides in a control, wherein a change or significant 
difference in the amount of polypeptides in tiie sample compared with the amount in the 
control is indicative of ovarian cancer. 

In an embodiment, the invention contemplates a method for monitoring the progression of ovarian 
cancer in an individual, comprising: 

(a) contacting antibodies which bind to each of a plurality of kallikrein polypeptides, and 



(d) comparing the result of step (b) with the result of step (c), wherein a difference in the 
amount of complex formation is indicative of the stage and/or progression of the ovarian 
cancer in said individual. 

The amount of complexes may also be compared to a value representative of tiie amount of the 
complexes from an individual not at risk of, or afflicted with, ovarian cancer at different stages. 



(b) 
(c) 



optionally CA125, with a sample from the individual so as to form binary complexes 
comprising each of the antibodies and polypeptides in the sample; 
determining or detecting the presence or amount of complex formation in the sample; 
repeating steps (a) and (b) at a point later in time; and 
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Thus, antibodies specifically reactive with each of a plurality of kallikrein polypeptides, and 
CA125, or derivatives, such as enzyme conjugates or labeled derivatives, may be used to detect a plurality of 
kallikrein polypeptides, and optionally CA125. in various samples (e.g. biological materials). They may be 
used as diagnostic or prognostic reagents and they may be used to detect abnormalities in Ae levels of 
5 expression of a plurality of kallikrein polypeptides, and optionally CA125, or abnormalities in the structure, 
and/or temporal, tissue, cellular, or subcellular location of a plurality of kallikrein polypeptides, and 
optionally CA125. Antibodies may also be used to screen potentially therapeutic compounds in vitro to 
determine their effects on ovarian cancer involving a plurality of kallikrein polypeptides, and optionally 
CA125, and other conditions. In vitro immunoassays may also be used to assess or monitor the efficacy of 
1 0 particular therapies. 

Antibodies may be used in any known immunoassays that rely on the binding interaction between 
antigenic determinants of a plurality of kallikrein polypeptides, and optionally CA125, and the antibodies. 
Examples of such assays are radioimmunoassays, enzyme immunoassays (e.g. ELISA), 
immunofluorescence, immunoprecipitation, latex agglutination, hemagglutination, and histochemical tests. 
15 These terms are well understood by those skilled in the art A person skilled in the art will know, or can 
readily discern, other immunoassay formats without undue experimentation. 

In particular, the antibodies may be used in immunohistochemical analyses, for example, at the 
cellular and sub-subcellular level, to detect a plurality of kallikrein polypeptides, and optionally CA125, to 
localize them to particular ovarian tumor cells and tissues, and to specific subcellular locations, and to 
20 quantitate the level of expression. 

Antibodies for use in the present invention include monoclonal or polyclonal antibodies, 
immunologically active firagments (e.g a Fab or (Fab)2 fi-agments), antibody heavy chains, humanized 
antibodies, antibody light chains, genetically engineered single chain Fy molecules (Ladner et al, U.S. Pat. 
No. 4,946,778), chimeric antibodies, for example, antibodies which contain the binding specificity of murine 
25 antibodies, but in which the remaining portions are of human origin, or derivatives, such as enzyme 
conjugates or labeled derivatives. 

Antibodies including monoclonal and polyclonal antibodies, fi^gments and chimeras, may be 
prepared using methods known to those skilled in the art. Isolated native or recombinant kallikrein 
polypeptides or CA125 may be utilized to prepare antibodies. See, for example, Kohler et al. (1975) Nature 
30 256:495-497; Kozbor et al, (1985) J. Immunol Methods 81:31-42; Cote et al. (1983) Proc Natl Acad Sci 
80:2026-2030; and Cole et al. (1984) Mol Cell Biol 62:109-120 for tiie preparation of monoclonal 
antibodies; Huse et al. (1989) Science 246:1275-1281 for tiie preparation of monoclonal Fab fiagments; and. 
Pound (1998) Immunochemical Protocols, Humana Press, Totowa, N.J for the preparation of phagemid or B- 
lymphocyte immunoglobulin libraries to identify antibodies. The antibodies specific for kallikrein 
35 polypeptides or CA125 used in the methods of the invention may also be obtained fi-om scientific or 
commercial sources. 

In an embodiment of the invention, antibodies are reactive against kallikrein polypeptides or CA125 
if they bind with a Kq of greater than or equal to 10'^ M. 

Antibodies that bind to kallikrein polypeptides or CA125 may be labelled with a detectable 
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substance and localised in biological samples based upon the presence of the detectable substance. Examples 
of detectable substances include, but are not limited to, the following: radioisotopes (e.g., ^H, *^C, '^S, 

fluorescent labels (eg., FITC, rhodamine, lanthanide phosphors), luminescent labels such as luminol, 
enzymatic labels (e.g., horseradish peroxidase, beta-galactosidase, luciferase, alkaline phosphatase, 
5 acetylcholinesterase), biotinyl groups (which can be detected by marked avidin e.g., streptavidin containing a 
fluorescent marker or enzymatic activity that can be detected by optical or colorimetric methods), and 
predetermined polypeptide epitopes recognized by a secondary reporter (e.g., leucine zipper pair sequences, 
binding sites for secondary antibodies, metal binding domains, epitope tags). In some embodiments, labels 
are attached via spacer arms of various lengths to reduce potential steric hindrance. Antibodies may also be 
10 coupled to electron dense substances, such as ferritin or colloidal gold, which are readily visualised by 
electron microscopy. 

Indirect methods may also be employed in which the primary antigen-antibody reaction is amplified 
by the introduction of a second antibody, having specificity for the antibody reactive against a kallikrein 
polypeptide or CA125. The second antibody may be labeled with a detectable substance to detect the 
1 5 primary antigen-antibody reaction. By way of example, if the antibody having specificity against a kallikrein 
polypeptide is a rabbit IgG antibody, the second antibody may be goat anti-rabbit gamma-globulin labelled 
with a detectable substance as described herein. 

Methods for conjugating or labelling the antibodies discussed above may be readily accomplished 
by one of ordinary skill in the art. (See for example Inman, Methods In Enzymology, Vol. 34, Affinity 
20 Techniques, Enzyme Purification: Part B, Jakoby and Wichek (eds.). Academic Press, New York, p. 30, 
1974; atid Wilchek and Bayer, 'The Avidin-Biotin Complex in Bioanalytical Applications,"AnaL Biochem. 
171:1-32, 1988 re methods for conjugating or labelling the antibodies with enzyme or ligand binding 
partner). 

Cytochemical techniques known in the art for localizing antigens using light and electron 
25 microscopy may be used to detect a plurality of kallikrein polypeptides, and optionally CA125. Generally, 
antibodies may be labeled with detectable substances and kallikrein polypeptides, and optionally CA12S, 
may be localised in tissues and cells based upon the presence of the detectable substance. 

In the context of the methods of the invention, the sample, binding agents (e.g. antibodies) for a 
plurality of kallikrein polypeptides, and CA125 may be immobilized on a carrier or support. Examples of 
30 suitable carriers or supports are agarose, cellulose, nitrocellulose, dextran, Sephadex, Sepharose, liposomes, 
carboxymethyl cellulose, polyacrylamides, polystyrene, gabbros, filter paper, magnetite, ion-exchange resin, 
plastic film, plastic tube, glass, polyamine-metiiyl vin>1-ether-maleic acid copolymer, amino acid copolymer, 
ethylene-maleic acid copolymer, nylon, silk, etc. The support material may have any possible configuration 
including spherical (e.g. bead), cylindrical (e.g. inside surface of a test tube or well, or the external surface of 
35 a rod), or flat (e.g. sheet, test strip). Thus, the carrier may be in the shape of, for example, a tube, test plate, 
well, beads, disc, sphere, etc. The immobilized material may be prepared by reacting the material with a 
suitable insoluble carrier using known chemical or physical methods, for example, cyanogen bromide 
coupling. Binding agents (e.g. antibodies) may be indirectly immobilized using second binding agents 
specific for the first binding agent For example, mouse antibodies specific for a kallikrein polypeptide may 
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be immobilized using sheep anti-mouse IgG Fc fragment specific antibody coated on the carrier or support. 

Where radioactive labels are used as a detectable substance, a plurality of kalUkrein polypeptides, 
and optionally CA12S, may be localized by radioautography. The results of radioautography may be 
quantitated by determining the density of particles in the radioautographs by various optical methods, or by 
5 counting the grains. 

Time-resolved fluorometry may be used to detect a signal. For example, the method described in 
Christopoulos TK and Diamandis EP Anal Chem 1992:64:342-346 may be used with a conventional time- 
resolved fluorometer. 

Therefore, in accordance with an embodiment of the invention, a method is provided wherein 

10 antibodies specific for each of a plurality of kallikrein polypeptides, and optionally CA125, are labelled with 
enzymes, substrates for the enzymes are added wherein the substrates are selected so that the substrates, or a 
reaction product of the enzymes and substrates, form fluorescent complexes with lanthanide metals. 
Lanthanide metals are added and the plurality of kallikrein polypeptides, and optionally CA125, are 
quantitated in the sample by measuring fluorescence of the fluorescent complexes. Antibodies specific for 

15 CA12S and each of a plurality of kallikrein polypeptides may be directly or indirectly labelled with enzymes. 
Enzymes are selected based on the ability of a substrate of the enzyme, or a reaction product of the enzyme 
and substrate, to complex with lanthanide metals such as europium and terbium. Examples of suitable 
enzymes include alkaline phosphatase and P-galactosidase. 

Examples of enzymes and substrates for enzymes that provide such fluorescent complexes are 

20 described in U.S. Patent No. 5,312,922 to Diamandis. By way of example, when the antibody is directly or 
indirectly labelled with alkaline phosphatase the substrate employed in the method may be 4- 
methylumbelliferyl phosphate, 5-fluorosalicyl phosphate, or diflunisal phosphate. The fluorescence intensity 
of the complexes is typically measured using a time-resolved fiuorometer e.g. a CyberFluor 615 
Imunoanalyzer (Nordion International, Kanata, Ontario). 

25 Antibodies specific for a plurality of kallikrein polypeptides and CA125 may also be indirectly 

labelled with enzymes. For example, an antibody may be conjugated to one partner of a ligand binding pair, 
and the enzyme may be coupled to the other partner of the ligand binding pair. Representative examples 
include avidin-biotin, and riboflavin-riboflavin binding protein. In another embodiment, antibodies specific 
for the anti-kallikrein antibodies or anti- CA125 antibodies are labeled with an enzyme. 

30 In accordance with an embodiment, the present invention provides means for determining a 

plurality of kallikrein polypeptides, and optionally CA125, in a sample, in particular a serum sample, by 
measuring a plurality of kallikrein polypeptides, and optionally CA125, by immunoassay. It will be evident 
to a skilled artisan that a variety of immunoassay methods can be used to measure a plurality of kallikrein 
polypeptides and CA125 in serum. In general, an immunoassay method may be competitive or 

35 noncompetitive. Competitive methods typically employ immobilized or immobilizable antibodies to each of 
a plurality of kallikrein polypeptides, and optionally CA125, and a labeled form of each of a plurality of 
kallikrein polypeptides, and optionally CA125. Kallikrein polypeptides and CA125 and labeled kallikrein 
polypeptides and CA125 compete for binding to anti-kallikrem antibodies and anti-CA125 antibodies. After 
separation of the resulting labeled kallikrein polypeptides and CA125 that have become bound to anti- 
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kallikrein polypeptides and anti- CA125 (bound fraction) from that which has remained unbound (unbound 
fraction), the amount of the label in either bound or unbound fraction is measured and may be correlated 
with the amount of kallikrein polypeptides, and optionally CA125, in the test sample in any conventional 
manner, e.g., by comparison to a standard curve. 
5 In an aspect, a non-competitive method is used for the determination of a plurality of kallikrein 

polypeptides, and optionally CA125, with the most common method being the "sandwich" method. In this 
assay, two types of antibodies specific for each of a plurality of kallikrein polypeptides, and optionally 
CA125 are employed. One t3^e of antibody is directly or indirectly labeled (sometimes referred to as the 
"detection antibody") and the other is immobilized or immobilizable (somethnes referred to as the "cq>ture 

10 antibody"). The capture and detection antibodies can be contacted simultaneously or sequentially with a test 
sample. Sequential methods can be accomplished by incubating capture antibodies with the sample, and 
adding the detection antibodies at a predetermined time thereafter (sometimes referred to as the "forward" 
method); or the detection antibodies can be incubated with the sample first and then the capture antibodies 
added (sometimes referred to as the "reverse" method). After the necessary incubation(s) have occurred, to 

15 complete the assay, the capture antibodies are separated from the liquid test mixture, and labels are measured 
in at least a portion of the separated capture antibody phase or the remainder of the liquid test mixture. 
Generally the labels are measured in the capture antibody phase since it comprises kallikrein polypeptides, 
and optionally CA125, bound by ("sandwiched" between) the capture and detection antibodies. In an 
embodiment, the label may be measured without separating the capture antibodies and liquid test mixture. 

20 In a typical two-site immunometric assay for a plurality of kallikrein polypeptides, and optionally 

CA125, one or both of the capture and detection antibodies are polyclonal antibodies or one or both of the 
capture and detection antibodies are monoclonal antibodies (i.e. polyclonal/polyclonal, 
monoclonal/monoclonal, or monoclonal/polyclonal). The labels used with the detection antibodies can be 
selected from any of those known conventionally in the art. The labels may be an enzyme or a 

25 chemiluminescent moiety, but it can also be a radioactive isotope, a fluorophor, a detectable ligand (e.g., 
detectable by a secondary bindmg by a labeled binding partner for the ligand), and the like. Preferably 
antibodies are labelled with enzymes which are detected by adding substrates that are selected so that a 
reaction product of the enzymes and substrates forms fluorescent complexes. Capture antibodies may be 
selected so that they provide a means for being separated from the remainder of the test mixture. 

30 Accordingly, the capture antibodies can be introduced to the assay in an already immobilized or insoluble 
form, or can be in an inmiobilizable form, that is, a form which enables inmiobilization to be accomplished 
subsequent to introduction of the capture antibodies to the assay. An immobilized capture antibody may 
comprise an antibody covalently or noncovalently attached to a solid phase such as a magnetic particle, a 
latex particle, a microtiter plate well, a bead, a cuvette, or other reaction vessel. An example of an 

35 immobilizable capture antibody is antibody which has been chemically modified with a ligand moiety, e.g., a 
hapten, biotin, or the like, and which can be subsequently immobilized by contact with an immobilized form 
of a binding partner for the ligand, e.g.. an antibody, avidin, or the like. In an embodiment, a capture 
antibody may be immobilized using a species specific antibody for the capture antibody that is bound to the 
solid phase. 
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A particular sandwich immunoassay method of the invention employs two types of antibodies, first 
antibodies are reactive against each of a plurality of kallikrein polypeptides, and optionally CA125, and 
second antibodies having specificity against antibodies reactive against each of a plurality of kallikrein 
polypeptides, and optionally CA125, labelled with enzymatic labels, and fluorogenic substrates for the 
5 enzymes. An enzyme may be alkaline phosphatase (ALP) and the substrate is 5-fluorosalic>d phosphate. 
ALP cleaves phosphate out of the fluorogenic substrate, 5-fluorosalicyl phosphate, to produce 5- 
fluorosalicyiic acid (FSA). 5-FluorosaIicylic acid can then form a highly fluorescent temary complex of the 
form FSA-Tb(3+)-EDTA, which can be quantified by measuring the Tb3+ fluorescence in a time-resolved 
mode. Fluorescence intensity is measured using a time-resolved fluorometer as described herein. 
10 The above-described innmunoassay methods and formats are intended to be exemplary and are not 

limiting. 

Computer Systems 

Computer readable media comprising a plurality of kallikrein markers, and optionally CA125, is 
also provided. "Computer readable media" refers to any medium that can be read and accessed directiy by a 
15 computer, including but not limited to magnetic storage media, such as floppy discs, hard disc storage 
medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. Thus, the invention 
contemplates computer readable medium having recorded thereon markers identified for patients and 
controls. 

20 "Recorded" refers to a process for storing information on computer readable medium. The skilled 

artisan can readily adopt any of the presentiy known methods for recording information on computer 
readable medium to generate manufectures comprising information on a plurality of kallikrein markers, and 
optionally CA125. 

A variety of data processor programs and formats can be used to store information on a plurality of 
25 kallikrein markers, and optionally CA125, on computer readable medium. For example, the information can 
be represented in a word processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. Any number of dataprocessor structuring formats (e.g., 
text file or database) may be adapted in order to obtain computer readable medium having recorded thereon 
30 the marker information. 

By providing the marker information in computer readable form, one can routinely access the 
information for a variety of purposes. For example, one skilled in the art can use the information in computer 
readable form to compare marker information obtamed during or following therapy with the information 
stored within the data storage means. 
35 The invention provides a medium for holding instructions for performing a method for determining 

whether a patient has ovarian cancer or a pre-disposition to ovarian cancer, comprising determining the 
presence or absence of a plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, and based on the presence or absence of the plurality of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, determining whether the patient has ovarian cancer or a pre-disposition to 
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ovarian cancer, and optionally recommending treatment for the ovarian cancer or pre-ovarian cancer 
condition. 

The invention also provides in an electronic system and/or in a network, a method for determining 
whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated with a plurality of 
5 kallikrein markers, and optionally CA125, and/or polynucleotides encoding same, comprising determining 
the presence or absence of a plurality of kallikrein markers, and optionally CA125, and/or polynucleotides 
encoding same, and based on the presence or absence of the plurality of kallikrein markers, and optionally 
CA125, and/or polynucleotides encoding same, determining whether the subject has ovarian cancer or a pre- 
disposition to ovarian cancer, and optionally recommending treatment for the ovarian cancer or pre-ovarian 
10 cancer condition. 

The invention further provides in a network, a method for determining whether a subject has 
ovarian cancer or a pre-disposition to ovarian cancer associated with a plurality of kallikrein markers, 
optionally CA125 and/or polynucleotides encoding same, comprising: (a) receiving phenotypic information 
on the subject and information on a plurality of kallikrein markers, optionally CA12S and/or polynucleotides 
IS encoding same associated with samples from the subject; (b) acquiring information from the network 
corresponding to the plurality of kallikrein maricers, optionally CA125, and/or polynucleotides encoding 
same; and (c) based on the phenotypic information and information on the plurality of kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, determining whether the subject has ovarian 
cancer or a pre-disposition to ovarian cancer; and (d) optionally recommending treatment for the ovarian 
20 cancer or pre-ovarian cancer condition. 

The invention still frirther provides a system for identifying selected records that identify an ovarian 
cancer cell. A system of the invention generally comprises a digital computer; a database server coupled to 
the computer; a database coupled to the database server having data stored therein, the data comprising 
records of data comprising a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
25 encoding same, and a code mechanism for applying queries based upon a desired selection criteria to the data 
file in the database to produce reports of records which match the desired selection criteria. 

In an aspect of the invention a method is provided for detecting an ovarian cancer cell using a 
computer having a processor, memory, display, and input/output devices, the method comprising the steps 
of: 

30 



35 



(a) creating records of a plurality of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, isolated from a sample suspected of containing an ovarian 
cancer cell; 

(b) providing a database comprising records of data comprising a plurality of kallikrein 
markers, optionally CA125, wherein the markers are kallikrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11, and/or comprising polynucleotides encoding 
same; and 

(c) using a code mechanism for applying queries based upon a desired selection criteria to the 
data file in the database to produce reports of records of step (a) which provide a match of 
the desired selection criteria of the database of step (b) the presence of a match being a 
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positive indication that the markers of step (a) have been isolated from a cell that is an 
ovarian cancer cell. 

The invention contemplates a business method for determining whether a subject has ovarian cancer 
or a pre-disposition to ovarian cancer associated with a plurality of kallikrcin markers, optionally CA125, 
5 and/or polynucleotides encoding same, comprising: (a) receiving phenotypic information on the subject and 
information on a plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same, 
associated with samples from the subject; (b) acquiring information from a network corresponding to the 
plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same; and (c) based on 
the phenotypic information, information on a plurality of kallikrein markers, optionally CA12S, and/or 
10 polynucleotides encoding same, and acquired information, determining whether the subject has ovarian 
cancer or a pre-disposition to ovarian cancer; and (d) optionally reconmiending treatment for the ovarian 
cancer or pre-ovarian cancer condition. 
Imaging Methods 

Antibodies specific for each of a plurality of kallikrein polypeptides, and optionally CA125, may 
15 also be used in imaging methodologies in the management of ovarian cancer. The invention provides a 
method for imaging tumors associated witii a plurality of kallikrein polypeptides, and optionally CA12S. 

In an embodiment the method is an in vivo method and a subject or patient is administered imaging 
agents that carry imaging labels and are capable of targeting or binding to each of a plurality of kallikrein 
polypeptides, and optionally CA125. In the method each imaging agent is labeled so that it can be 
20 distinguished during the imaging. The imaging agents are allowed to incubate in vivo and bind to the 
plurality of kallikrein polypeptides, and optionally CA125, associated with ovarian tumors. The presence of 
label is localized to the ovarian cancer, and the localized label is detected using imaging devices known to 
those skilled in the art. 

The imaging agents may be antibodies or chemical entities that recognize the plurality of kallikrein 
25 polypeptides, and optionally CA125. In an aspect of the invention an imaging agent is a polyclonal antibody 
or monoclonal antibody, or fragments thereof, or constructs thereof including but not limited to, single chain 
antibodies, bifiinctional antibodies, molecular recognition units, and peptides or entities that mimic peptides. 
The antibodies specific for kallikrein polypeptides and CA125 used in the methods of the invention may be 
obtained from scientific or commercial sources, or isolated native or recombinant kallikrein and CA125 
30 polypeptides may be utilized to prepare antibodies etc as described herein. 

An imaging agent may be a peptide that mimics the epitope for an antibody specific for kallikrein 
polypeptide or CA125 and binds to kallikrein polypeptide or CA125. The peptide may be produced on a 
commercial synthesizer using conventional solid phase chemistry. By way of example, a peptide may be 
prepared that includes either tyrosine, lysine, or phenylalanine to which N2S2 chelate is complexed (See U.S. 
35 Patent No. 4,897,255). The anti-kallikrein peptide conjugate is then combined with a radiolabel (e.g. sodium 
'^™Tc pertechnetate or sodium ***Re perrhenate) and it may be used to locate a tumor producing a plurality 
of kallikrein polypeptides, and optionally CA125. 

Imaging agents carry labels to image the plurality of kallikrein polypeptides and CA125. Agents 
may be labelled for use in radionuclide imaging. In particular, agents may be directiy or indirectly labelled 
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• with a radioisotope. Examples of radioisotopes that may be used in the present invention are the following: 
^"Ac, ^"At, "»Ba, "»Ba. 'Be, ^°^Bi, ^«^Bi, ^°^Bi, '^Br, '^Br. "Br. >«Cd, ^'Ca. "C, »^C, ^*C1, ^«Cr, ^^Cr, ^^Cu, 
"Cu, *"Cu, ^«Dy. '«Eu, «F, >»Gd. «Ga, *'Ga. ^Ga, '^Ga, "»Aii, ^H, ^«Ho, »'»In, "^™In, "^™In, ^"l, '"l, 

I89i^^ I91«i^^ I92j^^ 194j^^ 52pg^ 55p^^ 59pg^ ,77^^^ ,5^^ '''-I'^Os, '«Pd, ^^P, ^^K, "^Ra, ^'^Re, ^"Rc. '^Rb, 

5 »^^Sm, '•^Sc, ^'Sc, '^Se, '^Se, ^°^Ag, ^^Na, ^^Na, ^Sr, "S, ^»S, "'Ta, ^c. ^c, ^'''TI, '''Tl ^*'Sn, ^»^"Sn. 
^^•Sn, '^^Yb, ^^'Yb, "^Yb, ^«Y, ^°Y, ^^Zn and ''Zn. Preferably the radioisotope is '^1, ^«Tc, 

'°Y. '"'Re, *''Re, '^P. ^"Sm, ^Ga, -°^T1 "Br, or ''F, and it is imaged with a photoscanning device. 

Procedures for labeling biological agents with the radioactive isotopes are generally known in the 
art. U.S. Pat, No. 4,302,438 describes tritium labeling procedures. Procedures for iodinating, tritium labeling, 

10 and ^^S labeling especially adapted for murine monoclonal antibodies arc described by Goding, J. W. (supra, 
pp 124-126) and the references cited therein. Other procedures for iodinating biological agents, such as 
antibodies, binding portions thereof, probes, or ligands, are described in the scientific literature (see Hunter 
and Greenwood, Nature 144:945 (1962), David et al., Biochemistry 13:1014-1021 (1974), and U.S. Pat. Nos. 
3,867,517 and 4,376,110). Iodinating procedures for agents are described by Greenwood, F. et al., Biochem. 

15 J. 89:114-123 (1963); Marchalonis, J., Biochem. J. 113:299-305 (1969); and Morrison, M. et al., 
Immunochemistry, 289-297 (1971). ^™ Tc-labeling procedures are described by Rhodes, B. et al. in 
Burchiel, S. et al. (eds.), Tumor Imaging: The Radioimmunochemical Detection of Cancer, New York: 
Masson 111-123 (1982) and the references cited therein. Labelling of antibodies or fragments with 
technetium-99m are also described for example in U.S. Pat. No. 5,317,091, U.S. Pat. No. 4,478,815, U.S. 

20 Pat. No. 4,478,818, U.S. Pat No. 4,472,371, U.S. Pat No. Re 32,417, and U.S. Pat No. 4,311,688. 
Procedures suitable for In-labeling biological agents are described by Hnatowich, D. J. et al., J. Immul. 
Methods, 65:147-157 (1983), Hnatowich, D. et al, J. Applied Radiation, 35:554-557 (1984), and Buckley, 
R. G. et al., F.E.B.S. 166:202-204 (1984). 

An imaging agent may also be labeled with a paramagnetic isotope for purposes of an in vivo 

25 method of the invention. Examples of elements that are useful in magnetic resonance imaging include 
gadolinium, terbium, tin, iron, or isotopes thereof. (See, for example, Schaefer et al., (1989) JACXI 14, 472- 
480; Shreve et al., (1986) Magn. Reson. Med. 3, 336-340; Wolf, G L., (1984) Physiol. Chem. Phys. Med. 
NMR 16, 93-95; Wesbey et at, (1984) Physiol. Chem. Phys. Med. NMR 16, 145-155; Runge et al., (1984) 
Invest Radiol. 19, 408-415 for discussions on in vivo nuclear magnetic resonance imaging.) 

30 In the case of radiolabeled agents, the agents may be administered to the patient, localized to the 

tumor having a plurality of kallikrein polypeptides, and optionally CA125, with which the agents bind, and 
detected or "imaged" in vivo using known techniques such as radionuclear scanning using, for example, a 
gamma camera or emission tomography. [See for example, A. R. Bradwell et al., "Developments in 
Antibody Imaging", Monoclonal Antibodies for Cancer Detection and Therapy, R. W. Baldwin et at, (eds.), 

35 pp. 65-85 (Academic Press 1985)]. A positron emission transaxial tomography scanner, such as designated 
Pet VI located at Brookhaven National Laboratory, can also be used where the radiolabel emits positrons 
(e.g.,"C, "F,'^0,and"N). 

Whole body imagmg techniques using radioisotope labeled agents can be used for locating both 
primary tumors and tumors which have metastasized. Antibodies specific for a plurality of kallikrein 
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polypeptides, and optionally CA125, or fragments thereof having the same epitope specificity, are bound to a 
suitable radioisotope, or a combination thereof, and administered parenterally. For ovarian cancer, 
administration preferably is intravenous. The bio-distribution of the labels can be monitored by scintigraphy, 
and accumulations of the labels can be related to the presence of ovarian cancer cells. Whole body imaging 
5 techniques are described in U.S. Pat Nos. 4,036,945 and 4,311,688. Other examples of agents useful for 
diagnosis and therapeutic use that can be coupled to antibodies and antibody fragments include 
metallothionein and fragments (see, U.S. Pat No, 4,732,864). These agents are useful in diagnosis, staging 
and visualization of cancer, in particular ovarian cancer, so that surgical and/or radiation treatment protocols 
can be used more efficiently. 

10 Screening Methods 

The invention also contemplates methods for evaluating test agents or compounds for their ability to 
inhibit ovarian cancer or potentially contribute to ovarian cancer. Test agents and compounds include but 
are not limited to peptides such as soluble peptides including Ig-tailed fusion peptides, members of random 
peptide libraries and combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration 

15 amino acids, phosphopeptides (including members of random or partially degenerate, directed 
phosphopeptide libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, 
single chain antibodies, fragments, (e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope- 
binding fragments thereof)], nucleic acids (e.g. antisense, interference RNA) and small organic or inorganic 
molecules. The agents or compounds may be endogenous physiological compounds or natural or synthetic 

20 compounds. 

The invention also provides a method for assessing the potential ef&cacy of a test agent for 
inhibiting ovarian cancer in a patient, the method comprising comparing: 

(a) levels of a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in a first sample obtained from a patient and exposed to the test agent, 

25 wherein the markers comprise or are selected from the group consisting of kallikrein 5, 

kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

(b) levels of the plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in a second sample obtained from the patient, wherein the sample is not 
exposed to the test agent, wherein a significant difference in the levels of expression of a 

30 plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same, 

in the first sample, relative to the second sample, is an indication that the test agent is 
potentially efficacious for inhibiting ovarian cancer in the patient. 
The first and second samples may be portions of a single sample obtained from a patient or portions 
of pooled samples obtained from a patient 
35 In an aspect, the invention provides a method of selecting an agent for inhibiting ovarian cancer in a 

patient comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
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encoding same, in each of the aliquots, wherein the markers comprise or are selected from 
the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, 
andkallikrein 11; and 

(d) selecting one of the test agents which alters the levels of the kallikrein markers, optionally 
5 CA125, and/or polynucleotides encoding same, in the ahquot containing that test agent, 

relative to other test agents. 
Still another aspect of the present invention provides a method of conducting a drug discovery 
business comprising: 

(a) providing one or more methods or assay s^tems for identifying agents that inhibit ovarian 
10 cancer in a patient; 

0>) conducting therapeutic profiling of agents identified in step (a), or furtfier analogs thereof, 

for efficacy and toxicity in animals; and 
(c) formulating a pharmaceutical preparation including one or more agents identified in step 
(b) as having an acceptable therapeutic profile. 
15 In certain embodiments, the subject method can also include a step of establishing a distribution 

system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a 
sales group for marketing the pharmaceutical preparation. 

The invention also contemplates a method of assessing the ovarian carcinogenic potential of a test 
compound comprising: 

20 (a) maintaining separate aliquots of ovarian cells m the presence and absence of the test 

compound; and 

(b) comparing a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in each of the aliquots, wherein the markers comprise or are selected from 
the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, 

25 andkallikrein 11. 

A significant difference between the levels of the markers in the aliquot maintained in the presence 
of (or exposed to) the test compound relative to the aliquot maintained in the absence of the test compound, 
indicates that the test compound possesses ovarian carcinogenic potential. 
Kits 

30 The methods described herein may be performed by utilizing pre-packaged diagnostic kits 

comprising at least a plurality of kallikrein nucleic acids or binding agents (e.g. antibodies) or CA125 nucleic 
acids or binding agents described herein, which may be conveniently used, e.g., in clinical settings, to screen 
and diagnose patients, and to screen and identify those individuals afflicted with or exhibiting a 
predisposition to ovarian cancer. 

35 Thus, the invention also contemplates kits for carrying out the methods of the invention. Such kits 

typically comprise two or more components required for performing a diagnostic assay. Components include 
but are not limited to compounds, reagents, containers, and/or equipment. 

In an embodiment, a container with a kit comprises binding agents as described herein. By way of 
example, the kit may contain antibodies specific for a plurality of kallikrein polypeptides, and optionally 
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CA125, antibodies against the antibodies labelled with enzymes; and substrates for the enzymes. The kit may 
also contain microtiter plate wells, standards, assay diluent, wash buffer, adhesive plate covers, and/or 
instructions for carrying out a method of the invention using the kit. 

In an aspect of the invention, the kit includes antibodies or antibody fragments which bind 
S specifically to epitopes of each of a plurality of kallikrein polypeptides, and optionally CA12S, and means 
for detecting binding of the antibodies to epitopes associated with tumor cells, either as concentrates 
(including lyophilized compositions), which may be further diluted prior to use or at the concentration of 
use, where the vials may include one or more dosages. Where the kits are intended for in vivo use, single 
dosages may be provided in sterilized containers, having the desired amount and concentration of agents. 

10 Containers that provide a formulation for direct use, usually do not require other reagents, as for example, 
where the kit contains radiolabelled antibody preparations for in vivo imaging. 

A kit may be designed to detect the level of polynucleotides encoding kallikrein polypeptides, and 
optionally CA125 polynucleotides, in a sample. Such kits generally comprise oligonucleotide probes or 
primers, as described herein, that hybridize to a plurality of polynucleotides encoding kallikrein polypeptides 

IS and optionally CA125. Such oligonucleotides may be used, for example, within a PGR or hybridization 
procedure. Additional components that may be present within the kits include second oligonucleotides and/or 
diagnostic reagents to facilitate detection of a plurality polynucleotides encoding kallikrein polypeptides, and 
optionally CA125 polynucleotides. 

The reagents suitable for applying the screening methods of the invention to evaluate compounds 

20 may be packaged into convenient kits described herein providing the necessary materials packaged into 
suitable containers. 
Applications 

Kallikrein polypeptides (in particular, kallikrein 5, 6, 10 and 11), optionally in combination with 
CA125, are targets for ovarian cancer immunotherapy. Such immunotherapeutic methods include the use of 

25 antibody therapy, in vivo vaccines, and ex vivo immunotherapy approaches. 

In one aspect, the invention provides antibodies specific for a plurality of kallikrein polypeptides 
(for example, kallikreins 5, 6, 10 and 11) and optionally CA125, that may be used systemically to treat 
ovarian cancer. Preferably antibodies are used that target the tumor cells but not the surrounding non-tumor 
cells and tissue. Thus, the invention provides a method of treating a patient susceptible to, or having a cancer 

30 that expresses a plurality of kallikrein polypeptides, and optionally CA125, comprising administering to the 
patient an effective amount of antibodies that bind specifically to a plurality of kallikrein polypeptides, and 
optionally CA12S. In another aspect, the invention provides a method of inhibiting the growth of tumor cells 
expressing a plurality of kallikrein polypeptides, and optionally CA125, comprising administering to a 
patient antibodies which bind specifically to the plurality of kallikrein polypeptides, and optionally CA125, 

35 in amounts effective to inhibit growth of the tumor cells. Antibodies specific for a plurality of kallikrein 
polypeptides, and optionally CA125, may also be used in a method for selectively inhibiting the grov^^h of, 
or killing a cell expressing a plurality of kallikrein polypeptides, and optionally CA125, comprising reacting 
antibody immunoconjugates or immunotoxins with the cell in an amount sufficient to inhibit the growth of, 
or kill the cell. 
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By way of example, unconjugated antibodies specific for a plurality of kallikrein polypeptides, and 
optionally CA125, may be introduced into a patient such that the antibodies bind to cancer cells expressing a 
plurality of kallikrein polypeptides, and optionally CA125, and mediate growth inhibition of such cells 
(including the destruction thereof), and the tumor, by mechanisms which may include complement-mediated 
5 cytolysis, antibody-dependent cellular cytotoxicity, altering the physiologic function of a plurality of 
kallikrein polypeptides, and optionally CA125, and/or the inhibition of ligand binding or signal transduction 
pathways. In addition to unconjugated antibodies, antibodies specific for a plurality of kallikrein 
polypeptides, and optionally CAI25, conjugated to therapeutic agents (e.g. immunoconjugates) may also be 
used therapeutically to deliver the agents directly to tumor cells expressing a plurality of kallikrein 

10 polypeptides, and optionally CA125, and thereby destroy the tumor. Examples of such agents include abrin, 
ricin A, Pseudomonas exotoxin, or diphtheria toxin, proteins such as tumor necrosis factor, alpha-interferon, 
beta-interferon, nerve growth factor, platelet derived growth factor, tissue plasminogen activator, and 
biological response modifiers such as lymphokines, interleukin-l, interleukin-2, interleukin-6, granulocyte 
macrophage colony stimulating fector, granulocyte colony stimulating factor, or other growth factors. 

15 Cancer unmunotherapy using antibodies specific for a plurality of kallikrein polypeptides, and 

optionally CA125, may utilize the various approaches that have been successfiilly employed for cancers, 
including but not limited to colon cancer (Arlen et al., 1998, Crit Rev Immunol 18: 133-138), multiple 
myeloma (Ozaki et aL, 1997, Blood 90: 3179-3186; Tsunenati et al., 1997, Blood 90: 2437-2444), gastric 
cancer (Kasprzyk et al, 1992, Cancer Res 52: 2771-2776), B-cell lymphoma (Funakoshi et al., 1996, J 

20 Inununther Emphasis Tumor Immunol 19: 93-101), leukemia (Zhong et al., 1996, Leuk Res 20: 581-589), 
colorectal cancer (Moun et al., 1994, Cancer Res 54: 6160-6166); Velders et al, 1995. Cancer Res 55: 4398- 
4403), and breast cancer (Shepard et al., 1991, J Clin Immunol 11:1 17-127). 

In the practice of a method of the invention, antibodies specific for a plurality of kallikrein 
polypeptides, optionally in combination with antibodies specific for CA125, capable of inhibiting the growth 

25 of cancer cells expressing a plurality of kallikrein polypeptides, and optionally CA125, are administered in a 
therapeutically effective amoiuit to cancer patients whose tumors express or overexpress a plurality of 
kallikrein polypeptides, and optionally CA125. The invention may provide a specific, effective and long- 
needed treatment for ovarian cancer. The antibody therapy methods of tfie invention may be combined widi 
other therapies including chemotfierapy and radiation. 

30 Patients may be evaluated for the presence and levels of a plurality of kallikrein polypeptides, and 

optionally CA125, expression and overexpression in tumors, preferably using immunohistochemical 
assessments of tumor tissue, quantitative imaging as described herein, or otfier techniques capable of reliably 
indicating the presence and degree of expression of a plurality of kallikrem polypeptides, and optionally 
CA125. Immunohistochemical analysis of tumor biopsies or surgical specimens may be employed for this 

35 purpose. 

Antibodies specific for a plurality of kallikrein polypeptides and CA125 useful in treating cancer 
include those that are capable of initiating a potent immune response against the tumor and those that are 
capable of direct cytotoxicity. In this regard, the antibodies may elicit tumor cell lysis by either complement- 
mediated or antibody-dependent cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc 
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portion of the immunoglobulin molecule for interaction with effector cell Fc receptor sites or complement 
proteins. In addition, antibodies specific for a plurality of kallikrein polypeptides and CA125 that exert a 
direct biological effect on tumor growth are useful in the practice of the invention. Such antibodies may not 
require the complete immunoglobulin to exert the effect. Potential mechanisms by which such directly 
5 cytotoxic antibodies may act include inhibition of cell growth, modulation of cellular differentiation, 
modulation of tumor angiogenesis factor profiles, and the induction of apoptosis. The mechanism by which a 
particular antibody exerts an anti-tumor effect may be evaluated using any number of in vitro assays 
designed to determine ADCC, antibody- dependent macrophage-mediated cytotoxicity (ADMMC), 
complement-mediated cell lysis, and others known in the art. 

10 The anti-tumor activity of a combination of antibodies specific for a plurality of kallikrein 

polypeptides and optionally CA12S, may be evaluated in vivo using a suitable animal model. Xenogenic 
cancer models, wherein human cancer explants or passaged xenograft tissues are introduced into immune 
compromised animals, such as nude or SCID mice, may be employed. 

The methods of the invention contemplate the administration of combinations, or "cocktails" of 

IS different individual antibodies recognizing epitopes of a plurality of kallikrein polypeptides, and optionally' 
CA12S. Such cocktails may have certain advantages inasmuch as they contain antibodies that bind to 
different epitopes and/or exploit different effector mechanisms or combine directly cytotoxic antibodies with 
antibodies that rely on immune effector functionality. Such antibodies in combination may exhibit 
synergistic therapeutic effects. In addition, the administration of the antibodies may be combined with other 

20 therapeutic agents, including but not limited to chemoiherapeutic agents, androgen-blockers, and immune 
modulators (e.g., IL2, GM-CSF). The antibodies may be administered in their "naked" or unconjugated form, 
or may have therapeutic agents conjugated to them. 

The antibodies specific for a plurality of kallikrein polypeptides and optionally CA125, used in the 
practice of the method of the invention may be formulated into pharmaceutical compositions comprising a 

25 carrier suitable for the desired delivery method. Suitable carriers include any material which when combined 
with the antibodies retains the anti-tumor function of the antibodies and is non-reactive with the subject's 
immune systems. Examples include any of a number of standard pharmaceutical carriers such as sterile 
phosphate buffered saline solutions, bacteriostatic water, and the like (see, generally. Remington's 
Pharmaceutical Sciences 16.sup.th Edition, A. Osal, Ed., 1980). 

30 Antibody formulations may be administered via any route capable of delivering the antibodies to 

the tumor site. Routes of administration include, but are not limited to, intravenous, intraperitoneal, 
intramuscular, intratumor, intradermal, and the like. Preferably, the route of administration is by intravenous 
injection. Antibody preparations may be lyophilized and stored as a sterile powder, prefisrably under 
vacuum, and then reconstituted in bacteriostatic water containing, for example, benzyl alcohol preservative, 

35 or in sterile water prior to injection. 

Treatment will generally involve the repeated administration of the antibody preparation via an 
acceptable route of administration such as intravenous injection (IV), at an effective dose. Dosages will 
depend upon various fectors generally appreciated by those of skill in the art, including the type of cancer 
and the severity, grade, or stage of the cancer, the binding affmity and half life of the antibodies used, the 
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degree of expression of a plurality of kallikrein polypeptides, and optionally CA125, in the patient, the extent 
of circulating kallikrein polypeptide antigens, and optionally CA125 antigens, the desired steady-state 
antibody concentration level, frequency of treatment, and the influence of any chemotherapeutic agents used 
in combination with a treatment method of the invention. 
5 Daily doses may range from about 0.1 to 100 mg/kg. Doses in the range of 10-500 mg antibodies 

per week may be effective and well tolerated, although even higher weekly doses may be appropriate and/or 
well tolerated. A determining factor in defining the appropriate dose is the amount of antibodies necessary to 
be therapeutically effective in a particular context. Repeated administrations may be required to achieve 
tumor inhibition or regression. Direct administration of antibodies specific for a plurality of kallikrein 

10 polypeptides and optionally CA12S is also possible and may have advantages in certain situations. 

Patients may be evaluated for a plurality of kallikrein polypeptides and optionally CA125, 
preferably in serum, in order to assist in the determination of the most effective dosing regimen and related 
factors. The assay methods described herein, or similar assays, may be used for quantitating circulating 
kallikrein polypeptide and optionally CA125 levels in patients prior to treatment Such assays may also be 

15 used for monitoring throu^out therapy, and may be use&l to gauge therapeutic success in combination with 
evaluating other parameters, such as serum kallikrein polypeptides, and optionally CA125, levels. 

The invention further provides vaccines formulated to contain a plurality of kallikrein polypeptides, 
and optionally CA125, or fragments thereof The use in anti-cancer therapy of tumor antigens in a vaccine 
for generating humoral and cell-mediated immunity is well known and, for example, has been employed in 

20 prostate cancer using human PSMA and rodent PAP immunogens (Hodge et al., 1995, Int. J. Cancer 63: 231- 
237; Pong et al., 1997, J. Immunol 159: 3113-3117). These methods can be practiced by employing a 
plurality of kallikrein polypeptides, and optionally CA125, or Augments thereof, or nucleic acids and 
recombinant vectors capable of expressing and appropriately presenting the kallikrein and optionally CA125, 
immunogens. 

25 By way of example, viral gene delivery systems may be used to deliver nucleic acids encoding a 

plurality of kallikrein polypeptides, and optionally CA125. Various viral gene delivery systems which can be 
used in the practice of this aspect of the invention include, but are not limited to, vaccinia, fowlpox, 
canarypox, adenovirus, influenza, poliovirus, adeno-associated virus, lentivirus, and sindbus virus (Restifo, 
1996, Curr. Opin. Immunol. 8: 658-663). Non-viral delivery systems may also be employed by using naked 

30 DNA encoding a plurality of kaUikrein polypeptides, and optionally CA125, or fragments thereof introduced 
into the patient (e.g., intramuscularly) to induce an anti-tumor response. 

Various ex vivo strategies may also be employed. One approach involves the use of cells to present 
kallikrein and optionally CA125 antigens to a patient's immune system. For example, autologous dendritic 
cells which express MHC class I and II, may be pulsed with a plurality of kallikrein polypeptides, and 

35 optionally CA125, or peptides thereof that are capable of binding to MHC molecules, to thereby stimulate 
ovarian cancer patients* immune systems (See, for example, Tjoa et al., 1996, Prostate 28: 65-69; Murphy et 
al.. 1996, Prostate 29: 371-380). 

Anti-idiotypic antibodies specific for a plurality of kallikrein polypeptides, and optionally CA125, 
can also be used in anti-cancer therapy as a vaccine for inducing an immune response to cells expressing the 
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polypeptides. The generation of anti-idiotypic antibodies is well known in the art and can readily be adapted 
to generate anti-idiotypic antibodies that mimic an epitope on a kallikrein polypeptide or CA125 (see, for 
example, Wagner et al., 1997, Hybridoma 16: 33-40; Foon et aL, 1995, J Clin Invest 96: 334-342; Herlyn et 
al., 1996, Cancer Immunol Immunother 43: 65-76). Such antibodies can be used in anti-idiotypic therapy as 
5 presently practiced with other anti-idiotypic antibodies directed against tumor antigens. 

Genetic immunization methods may be utilized to generate prophylactic or therapeutic humoral and 
cellular immune responses directed against cancer cells expressing a plurality of kallikrein polypeptides, and 
optionally CA125. Constructs comprising DNA encoding kallikrein and optionally CA125 
polypeptides/inmiunogens and appropriate regulatory sequences may be injected directly into muscle or skin 

10 of an individual, such that the cells of the muscle or skin take-up the construct and express the encoded 
kallikrein or CA125 polypeptides/immunogens. The polypeptides/immunogens may be expressed as cell 
surface proteins or be secreted. Expression of the polypeptides/immunogens results in the generation of 
prophylactic or therapeutic humoral and cellular immunity against the cancer. Various prophylactic and 
therapeutic genetic immunization techniques known in the art may be used. 

15 The invention further provides methods for inhibiting cellular activity (e.g., cell proliferation, 

activation, or propagation) of a cell expressing a plurality of kallikrein polypeptides, and optionally CA125. 
This method comprises reacting immunoconjugates of the invention (e.g., a heterogeneous or homogenous 
mixture) with the cell so that the kallikrein polypeptides, and optionally CA125, form complexes with the 
immunoconjugates. A subject with a neoplastic or preneoplastic condition can be treated when the inhibition 

20 of cellular activity results in cell death. 

In another aspect, the invention provides methods for selectively inhibiting a cell expressing a 
plurality of kallikrein polypeptides, and optionally CA125, by reacting a combination of immunoconjugates 
of the invention with the cell in an amount sufficient to inhibit the cell. Amounts include those that are 
sufficient to kill the cell or sufficient to inhibit cell growth or proliferation. 

25 Vectors derived fix)m retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 

plasmids, may be used to deliver nucleic acids encoding a plurality of kallikrein polypeptides, and optionally 
CA125, to a targeted organ, tissue, or cell population. Methods well known to those skilled in the art may be 
used to construct recombinant vectors that will express antisense nucleic acid molecules for kallikrein 
polypeptides and CA125. (See, for example, &e techniques described in Sambrook et al (supra) and Ausubel 

30 et al (supra)). 

Genes encoding a plurality of kallikrein polypeptides, and optionally CA125, can be turned off by 
transfecting a cell or tissue with vectors that express high levels of a desired kallikrein or CA125 
polypeptide-encoding fragments. Such constructs can inundate cells with untranslatable sense or antisense 
sequences. Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
35 molecules until all copies are disabled by endogenous nucleases. 

Modifications of gene expression can be obtained by designing antisense molecules, DNA, RNA or 
PNA, to the regulatory regions of genes encoding kallikrein polypeptides, and optionally CA125, i.e., the 
promoters, enhancers, and introns. Preferably, oligonucleotides are derived fixmi the transcription initiation 
site, eg, between -10 and +10 regions of the leader sequence. The antisense molecules may also be designed 
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so that they block translation of mRNA by preventing the transcript from binding to ribosomes. Inhibition 
may also be achieved using "triple hehx" base-pairing methodology. Triple helix pairing compromises the 
ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or 
regulatory molecules. Therapeutic advances using triplex DNA were reviewed by Gee J E et al (In; Ruber B 
5 E and B I Carr (1994) Molecular and Immunologic Approaches, Futura Publishing Co, Mt Kisco N.Y.). 

Ribozymes are enzymatic RNA molecules that catalyze the specific cleavage of RNA. Ribozymes 
act by sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. The invention therefore contemplates engineered hammerhead motif ribozyme 
molecules that can specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding a 

10 plurality of kallikrein polypeptides, and optionally CA12S. 

Specific ribozyme cleavage sites within any potential RNA target may initially be identified by 
scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, 
GUU and GUC. Once the sites are identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage site may be evaluated for secondary 

15 structural features which may render the oligonucleotide inoperable. The suitability of candidate targets may 
also be determined by testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. 

Methods for introducing vectors into cells or tissues include those methods discussed herein and 
which are suitable for in vivo, in vitro and ex vivo therapy. For ex vivo therapy, vectors may be introduced 

20 into stem cells obtained from a patient and clonally propagated for autologous transplant into the same 
patient (See U.S. Pat Nos. 5,399,493 and 5,437,994). Delivery by transfection and by liposome are well 
known in the art 

Kallikrein polypeptides, optionally CA125 polypeptide, and/or polynucleotides encoding the 
polypeptides, and fragments thereof, antibodies and/or agents identified using a method of the invention, or 

25 combinations thereof, may be used in the treatment of ovarian cancer or diseases, conditions or syndromes 
associated with ovarian cancer, in a subject A combination of kallikrein polypeptides and/or 
polynucleotides encoding the kallikreins (e.g. kallikreins 7 and 8) and inhibitors (antisense, antibodies, or 
agents) of other kallikreins (e.g. kallikreins 5, 6, 10 and 11) and/or CA125 may be used in a prognostic or 
therapeutic method of the invention. The polypeptides, polynucleotides, and agents may be formulated into 

30 compositions for administration to subjects suffering from ovarian cancer. Therefore, the present invention 
also relates to a composition comprising a plurality of kallikrein polypeptides and optionally CA125, or 
nucleic acids encoding the polypeptides, or a fragment thereof, or an agent identified using a method of the 
invention, and a pharmaceutically acceptable carrier, excipient or diluent. A method for treating or 
preventing ovarian cancer in a subject is also provided comprising administering to a patient in need thereof, 

35 a plurality of kallikrein polypeptides and optionally CA125, or nucleic acids encoding the polypeptides, an 
agent identified in accordance with a method of the invention, and/or a composition of the invention. 

The active substance may be administered in a convenient manner such as by injection 
(subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal 
administration. Depending on die route of administration, the active substance may be coated in a material to 
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protect the substance from the action of enzymes, acids and other natural conditions that may inactivate the 
substance. 

The compositions described herein can be prepared by perse known methods for the preparation of 
pharmaceutically acceptable compositions which can be administered to subjects, such that an effective 
5 quantity of the active substance is combined in a mixture with a pharmaceutically acceptable vehicle. 
Suitable vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., USA 1985). On this basis, the 
compositions include, albeit not exclusively, solutions of the active substances in association with one or 
more pharmaceutically acceptable vehicles or diluents, and contained in buffered solutions with a suitable 
10 pH and iso-osmotic with the physiological fluids. 

The compositions are indicated as therapeutic agents either alone or in conjunction with other 
therapeutic agents or other forms of treatment (e.g. chemotherapy or radiotherapy). The compositions of the 
invention may be administered concurrently, separately, or sequentially with other therapeutic agents or 
therapies. 

1 5 The following non-limiting examples are illustrative of the present invention: 

Example 1 

To investigate the additional discriminatory value of the kallikreins to CA125 a logistic regression 
model was developed Included in the study were serum samples from 39 ovarian cancer patients and 194 
non-cancer females. The age of the patients was as follows: median = 59, range 32-82. The age of the 

20 controls was as follows: median = 46; range = 22-77. The model was adjusted for the following variables: 
f(x) « -0.29 hK5* +0.12* hK6-0.65*hK7-0.6*hK8+1.09*hK10+0.98*hKlI+0.057*CA125-0.62. For these 
data, the crude odds ratio and tlie 95% confidence interval were found to be 2.71 and 1.91-3.84 (p<0.001). 
The log likelihood scores for this multivariate logistic regression model, which incorporated the combined 
variables for each patient was calculated. From these data, by picking different thresholds for the regression 

25 function values, a ROC curve was devised which shows the added value of using kallikreins and CA125 
together in a multivariate function.( AUC, 0.99;95%CI,0,96-1.00), (See Figure 8.) Statistically significant 
correlations between age and other studied variables were not observed. 
Example 2 

Statistically significant differences in serum kallikrein concentration was found between patient and 
30 control subjects for kallikreins hK5 (p<0.0001), hK7 (p=0.007), hK8 (p=0.005), hKlO (p=0.0003) and 
CA125 {p<0.0001) by the Mann-Whitney test The diagnostic sensitivity (SENS) and specificity (SPEC) for 
each one of these markers were as follows (SENS/SPEC; both as %): 31/95 (hBC5); 62/71 (hK7); 62/70 
(hKlO); 54/54 (hKU); 89/94 (CA125). When these data were combined in a logistic regression model, 
kallikreins 5 and 10 did not contribute to a great extent to the sensitivity and specificity of CA125. The area 
35 under the curve of CA125 alone (93%) improved by a further 1% when adding hK6, by 2% when adding 
hKll, 3% when adding hK7 and 5% when adding hK8. The combination of CA125 and hK8 resulted in an 
AUC of 98%. 

Below is a summary of each marker and its ability to separate the cases and controls. 
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hK5: high values associated with cancer 
testf is hK5>0.10, test- is hK5<K).10 
sensitivity=31%, specificity=95%, 
AUC=,62, p(AUC)=.02 
5 Wilcoxon rank sum test has p<.0001 , 

Of the 233 persons analyzed, 207 have value zero for hK5 (27 cases, 180 controls). 
Possible good marker 

hK6: high values associated with cancer 
1 0 tesH- is hK6>6.3, test- is hK6<=6.3 
sensitivity=69%, specificit>p40%, 
AUO.50. p{AUC)=1.00 
Wilcoxon rank sum test has p=*.91. 
Not a good marker 

15 

hK7: low values associated with cancer 
testf is hK7<2.05, test- is hK7>=2.05 
sensitivity=62%, specificity=71%, 
AU0.64, p(AUC)=.006 
20 Wilcoxon rank sum test has p=.007. 
Possible good marker 

hK8: low values associated with cancer 
tesH- is hK8<13.0, test- is hK8>-13.0 

25 sensitivity=72%, specificity=42%, 
AU0.64, p(AUC)=.006 
Wilcoxon rank sum test has p=.005 
Possible good marker 

30 hKlO: high values associated with cancer 

tesH- is hK10>1.42, test- is hK10<=1.42 

sensitivity=62%, specificity=70%, 

AU0.68. p(AUC)=.0004 

Wilcoxon rank sum test has p=.0003. 
35 Best single kallikrein marker 

hKl 1 : high values associated with cancer 
test+ is hKl 1>0. 14, test- is hKl 1<=0.14 
sensitivity=54%, specificity=54%, 
40 AUO=.58,p(AUCH12 

Wilcoxon rank sum test has p=. 1 1. 
Not a good marker 

CA125: higih values associated with cancer 
45 testf is Cal25>34, test- is Cal25<=34 
sensitivity=89%, specificity=^94%, 
AUC=.933, p(AUC)=<.0001 
Wilcoxon rank sum test has p<.0001. 
Good marker 

50 

After some further multivariate analysis of only the kallikrein markers, the combination of hK 7, 8, 10 and 
1 1 was a preferred set. This combination was arrived at by looking at the incremental AUG as markers were 
combined. Below is a summary of all the models tried: 



55 hKlO alone, AU0.68 
hK10+hK7: AUC=.88 
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hKlOf hK7+hK8: AUO.90 
hKl(>fhK7+hK8+hKl 1 : AUC= 925 

Multivariate model of hK7, hK8, hKlO, hKIl, call it hK7_8_10_l 1 

5 hK7_8J10Jl: 

Calculate SA=2.00-L49(hK7>.34(hK8)+1.16(hK10)+3.50(hKl 1) 
high values associated with cancer 
testf is SA>-1.15, test- is SA<=-1.15 
sensitivity=87%, specificity=89%, 
10 AU0.93, p(AUC>=<.0001 

Wilcoxon rank sum test has p<0001. 
Good marker 

The hK marker that added the most to CA125 was also investigated. 

15 CA125 alone, AU0.933 
CA125+hK8: AUC-.978 

Multivariate model of Cal25, hK8, call it Cal25_hK8 



Cal25_,hK8: . 
20 SO-1.7H-.086(Cal25)-.47(hK8). 

high values associated with cancer 

test+ is SO-2.52. test- is SC<-2.52 

sensitivit5F=97%, specificity=90%, 

AU0.978, p(AUC)=<.0001 
25 Wilcoxon rank sum test has p<.0001. 

Good marker 

Below is a summary of the above analyses: 
a) The preferred kallikrien marker alone is hKlO, AU0.68 
30 b) CA125 has an AUG of .933 

c) The preferred combination of kallikrein markers increases the AUG up to .925, which is close to the 
CA125 AUG of ,933 

d) Adding a kallikrein marker to CA125 can increase the AUG up to .978 



35 How does CA125 alone compare with the multivariate kallikrein model hK7_8_10_l 1? 
(based on 39 cases and 186 controls evaluated with GA125) 



40 



CA125 
hK7_^8J0_ll 
both positive 
either positive 



Sensitivity Specificity misclassification 

90% 94% 12FP, 4FN, total 16 pts misclassified 

85% 89% 3 IFP, 4FN, total 35 pts misclassified 

77% 100% 0FP,9FN, total 9 pts misclassified 

97% 82% 33FP, IFN, total 34 pts misclassified 



How does GA125 alone compare with &e multivariate model of GA125 plus hK8? 
45 (based on 39 cases and 186 controls evaluated with CA125) 

misclassification 

12FP, 4FN, total 16 pts misclassified 
17FP, 2FN, total 19 pts misclassified 



GA125 
GA125 hK8 



Sensitivity 

90% 

95% 



Specificity 

94% 

91% 
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Kallikrein markers approach CA125 in terms of AUG and sensitivity, but their specificity is not as 
high. Adding hK8 to CA125 improves sensitivity but specificity is lower than CA 125 alone. 
Summary 

5 a) The best kallikrein marker alone is hKl 0, area under the ROC curve (AUG) =.68. 

b) CA125 has an AUG of .933. Adding a single kallikrein marker to CA125 can get the AUG 
up to .978. Adding hK8 to GA125 improves sensitivity but specificity is lower compared 
with CA125 alone. 

c) The best combination of kallikrein markers gets the AUG up to .925, which is close to the 
10 CA125 AUG of .933. Kallikrein markers approach CA125 in terms of AUG and 

sensitivity, but their specificity is lower. 



The present invention is not to be limited in scope by the specific embodiments described herein, 
15 since such embodiments are intended as but single illustrations of one aspect of &e invention and any 
functionally equivalent embodiments are within the scope of this invention. Indeed, various modifications of 
the invention in addition to those shown and described herein will become apparent to those skilled in the art 
fi-om the foregoing description and accompanying drawings. Such modifications are intended to fall within 
. the scope of the appended claims. 
20 All publications, patents and patent applications referred to herein are incorporated by reference in 

their entirety to the same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference in its entirety. All publications, 
patents and patent applications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the domains, cell lines, vectors, methodologies etc. which are reported therein 
25 which might be used in connection with the invration. Nothing herein is to be construed as an admission that 
the invention is not entitled to antedate such disclosure by virtue of prior invention. 

It must be noted that as used herein and in the appended claims, the singular forms "a", "an", and 
"the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a 
host cell" includes a plurality of such host cells, reference to the "antibody" is a reference to one or more 
30 antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Below full citations are set out for the references referred to in the specification. 



wo 2004/075713 



PCT/CA2004/000281 



-37- 



Table 1 



KaUikrein Polypeptide 


Kallikrein Nucleic Acid 
Designation 


GenBank Accession No. 


Kallikrein 5 


KLK5 


AAD26429, AF135028, AF168768 


Kallikrein 6 


KLK6 


AAB66483, AF0I3988(CDS 174..881), 
AF149289 (CDS join 3567..3606, 4346..4502, 
8122..8369, 979l..9927,l 1805.. 11957) U62801 
(CDS 246..980) 


Kallikrein 7 


KLK7 


AAC37551, L33404 (CDS 16..777), AF166330 
(CDS join 3237..3309, 3722..3869, 4566..4813, 
5129..5265, 7362..7517) (mRNA join(1756..1785, 

"^170 '^'^nO "^799 "^RfJQ ASfifi ZtRl'^ ^19Q ^Ofi^ 

7362.. 8265) /product=''stratuni comeum 
chymotryptic enzyme" /note="alteraatively 
spliced" ; mRNA join (1756.. 1785, 3179..3309, 
3722.,3869, 4566.^4813, 5129..5265, 7362-7991) 
/note="altemativeIy spliced"; mRNA join 
ri821 1864 3179 3309 3799 '^RfiO ART^ 
5129..5265, 7362..8265) /product="stratum 
comeum chymotryptic enzyme" 
/note="altematively spliced"; mRNA join 
(1821..1864, 3179..3309, 3722.3869, 4566..4813, 
5 129.. 5265, 7362.. 7991) /note="altematively 
spliced" 


Kallikrein 8 


KLBC8 


BAA28673, AB009849 (CDS 35..817), 
AF095743 (CDS join 1035,.1104, 1619..1778, 
1944..2206, 4304..4437, 5974..6129, mRNA 
500..670. 1027..1104, 1619..1778, 1944..2206, 
4304..4437, 5974..6174), AB010780 (CDS join 
1..39. 418..712. 878.>946). AF055982 


Kallikrein 10 


KLKIO 


AAC14266, AF055481 (CDS join614..701. 
2455..2635, 3589,.3863, 4195..4328, 4793..4945, 
mRNA join 48.. 120, 605..701, 2455..2635, 
3589.-3863, 4195..4328, 4793..5474), 
NMJ)02776 (CDS 220..1050) 


Kallikrein 11 


KLKll 


BAA33404, AAD47815, AB012917 (CDS 
26..874), AF164623 (CDS 4224..4263, 
5061..5217, 5545..5810, 6627.,6763, 7158..7310) 
(mRNA join (2313..2398, 4189..4263, 
506L.5217.5545..5810,6627..6763,7158..7622) 
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Table2 

Descriptive statistics for likS, hk6, Iilc7, iikS, hklO and iilcll serum protein levels in controls and 

patients with ovarian cancer 



hkSfnff/ml) 

Non cancer (N-194) 
Cancer (N-3 9) 
% Increase** 



Mean Standard Error Median Range pvalue^ 

0.063 0.029 0.00 0.00-4.50 

0.48 0.18 0,00 0.00-5.70 

661% - - <0.001 



^k^fnfi^^l) 

Non cancer (N=194) 6.96 

Cancer (N=39) 9.88 

% Increase*** 42% 



0.18 
2.20 



6.60 
6.60 



1.60-15.30 
1.50-70.80 



0.91 



hk7fnfi/ml) 

Non cancer (N=194) 2.60 

Cancer (N-39) 2.49 

% Decrease** 4% 



0.071 
0.41 



2.67 
1.80 
33% 



0.30-6.00 
0.00-10.80 



0.007 



hkSrng/mn 
Non cancer (N=194) 
Cancer (N=39) 
% Decrease** 



11.74 
11.91 



0.27 
1.88 



11.70 
6,90 
41% 



2.40-22.20 
0.00-46.20 



0.005 



hKlOfng/ml) 

Non cancer (N=194) 1.16 

Cancer (N=39) 6.51 

% Increase** 461% 



0,051 
2.46 



1.08 
1.59 
40% 



0.00-4.20 
0.27-90.0 



<0.001 



liKlUng/nil) 

Non cancer (N-194) 0.21 

Cancer (N=39) 0.79 

% Increase** 276% 



0.018 
0.21 



0.12 
0.18 
50% 



00-1.30 
0.00-5.52 



0.011 



10 



* Calculated by the Mann Whitney test 

'^* Calculated by assuming that value in non-cancerouos tissue is 100% 
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Table3 

Correlations between the studied variables in 194 non-cancer cases 



variable 



liK5 



liK6 



liK7 



hK8 liKlO hKll CA125 



hK5 



r, 1.000 0.034 -0.053 0.066 0.134 0.150 0.101 



0.642 0.462 0359 0.062 0.037 0.172 



hK6 



hK7 



hK8 



hKlO 



hKll 



CA125 



0.034 
0.642 
-0.053 
0.462 
0.066 
0.359 
0.134 
0.062 
0.150 
0.037 
0.101 
0,172 



LOOO 



0.114 



0.II5 



0.191 



0.114 0.298 0.191 0.120 -0.160 



0.115 0.000 0.008 0.097 0.029 



1.000 0.497 0.321 0.399 



0.000 0.000 



0.008 0.000 0.000 



0.135 



0.000 0.000 0.000 0.066 



0.298 0.497 1.000 0.263 0.396 0.048 



0.000 0.000 0.519 



0.321 0,263 1.000 0.176 0.035 



0.014 0.638 



0.120 0.399 0,396 0.176 l.OOO 0.125 



0.097 0.000 0.000 0.014 . 0.090 



-0.160 0.135 0.048 0.035 0.125 1.000 



0.029 0.066 0.519 0.638 0.090 
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Table4 

Correlations between the studied variables in 39 ovarian cancer cases 

variable hKS hK6 hK7 hK8 hKlO hKll CA125 

liK5 r, 1.000 0.475 0.553 0.554 0.618 0.584 0.507 

p . 0M2 0.000 OMO 0.000 0.000 0.001 

hK6 r, 0.475 1.000 0.327 0.513 0.470 0.661 0.530 

P 0.002 0.042 0.001 0.003 OMO 0.001 

hK7 0.553 0.327 1.000 0.695 0.690 0.748 0.262 

p 0.000 0.042 0.000 0.000 0.000 0.107 

hK8 0.554 0.513 0.695 1.000 0.602 0.783 0.443 

p 0.000 0.001 0.000 0.000 0.000 0.005 

hKlO Ts 0.618 0.470 0.690 0.602 1.000 0.706 0.548 

p 0.000 0.003 0.000 0.000 . 0.000 OMO 

hKll r. 0.584 0.661 0.748 0.783 0.706 1.000 0.556 

p 0.000 0.000 0.000 0.000 0.000 . 0.000 

CA125 Fs 0.507 0.530 0.262 0.443 0.548 0.556 1.000 

p 0.001 0.001 0.107 0.005 0.000 0.000 
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Table 5 



1 s 

IS 



> c 



II 
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We Claim ! 

1. A method for detecting a plurality of kallikrein markers associated with ovarian cancer in a patient 
comprising: 

5 (a) obtaining a sample from a patient; 

(b) detecting in the sample a plurality of kallikrein markers and optionally CA125, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10. and kallikrein 11; and 

(c) comparing the detected amounts with amounts detected for a standard. 

10 2. A method for diagnosing and monitoring ovarian cancer in a subject comprising detecting in a 
sample from the subject a plurality of kallikrein markers, wherein the kallikrein markers comprise 
or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein. 

3. A method as claimed in claim 1 or 2 wherein the plurality of kallikrein markers aie detected using 
IS antibodies tliat bind to each of the plurality of kallikrein markers or parts thereof 

4. A method as claimed in claim 1, 2, 3 which further comprises detecting CA125. 

5. A method of detecting ovarian cancer in a patient, the method comprising comparing: 

(a) levels of a plurality of kallikrein markers, and optionally CA125, in a sample from the 
patient, wherein the kallikrein markers comprise or are selected from the group consisting 

20 of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; and 

(b) normal levels of expression of the plurality of kallikrein markers, and optionally CA125, in 
a control sample, wherein a significant difference in levels of kallikrein markers and 
optionally CA125, relative to the corresponding normal levels, is indicative of ovarian 
cancer. 

25 6. A method for monitoring the progression of ovarian cancer in a patient, the method comprising: (a) 
detecting in a sample from the patient at a first time point, a plurality of kallikrein markers, wherein 
the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 
6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; (b) repeating step (a) at a subsequent 
point in time; and (c) comparing levels detected in steps (a) and (b), and thereby monitoring the 

30 progression of ovarian cancer. 

7. A method for determining in a patient whether ovarian cancer has metastasized or is likely to 
metastasize in the future, the method comprising comparing (a) levels of a plurality of kallikrein 
markers, and optionally CA125, in a patient sample, wherein the kallikrein markers comprise or are 
selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 

35 10, and kallikrein 11; and (b) normal levels or non-metastatic levels of the kallikrein markers and 

optionally CA125, in a control sample wherein a significant difference between the levels of 
expression in the patient sample and the normal levels or non-metastatic levels is an indication that 
the ovarian cancer has metastasized. 

8. A method for assessing the aggressiveness or indolence of ovarian cancer comprising comparing: 
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(a) levels of expression of a plurality of kallikrein markers, and optionally CA125, in a patient 
sample, wherein the kallikrein markers comprise or are selected jfrom the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (b) normal 
levels of expression of the plurality of markers and optionally CA125, in a control sample, wherein 

5 a significant difference between the levels in the patient sample and normal levels is an indication 

that the cancer is aggressive or indolent. 

9. A method for diagnosing and monitoring ovarian cancer in a sample from a subject comprising 
isolating nucleic acids from the sample; and detecting in the sample polynucleotides encoding a 
plurality of kallikrein markers, and optionally CA125, wherein the kallikrein markers comprise or 

10 are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 

kallikrein 10, and kallikrein. 

10. A method as claimed in claim 9 wherein significant differences in the levels of the polynucleotides 
in the sample compared to a control is indicative of disease, disease stage, and/or prognosis. 

11. A method for determining the presence or absence of ovarian cancer in a subject comprising: (a) 
15 contacting a sample obtained from the subject with oligonucleotides that hybridize to 

polynucleotides encoding kallikrein markers, and optionally CA125, wherein the kallikrein markers 
comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrem 10, and kallikrein 11; and (b) detecting in the sample a level of nucleic acids 
in the sample that hybridize to the polynucleotides relative to a predetermined cut-off value, and 
20 therefrom determining the presence or absence of ovarian cancer in the subject. 

12. A method as claimed in claim 11, wherein the nucleic acids are mRNA and the levels of nucleic 
acids are detected by polymerase chain reaction. 

13. A method as claimed in claim 1 1 wherein the nucleic acids are mKNA and the amounts of mRNA 
are detected using a hybridization technique, employing oligonucleotide probes that hybridize to 

25 kallikrein markers, and optionally CA125. 

14. A method for assessing the potential efficacy of a test agent for inhibiting ovarian cancer in a 
patient, the method comprising comparing: (a) levels of a plurality of kallikrein markers, optionally 
CA125, and/or polynucleotides encoding same, in a first sample obtained from a patient and 
exposed to the test agent, wherein the kallikrein markers comprise or are selected from the group 

30 consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

(b) levels of the plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, in a second sample obtained from the patient^ wherein the sample is not exposed to the test 
agent, wherein a significant difference in the levels of expression of the plurality of kallikrein 
marifiers, optionally CA125, and/or polynucleotides encoding same, in the first sample, relative to 

35 the second sample, is an indication that the test agent is potentially efficacious for inhibiting ovarian 

cancer in the patient 

15. A method of claim 14 wherein the first and second samples are portions of a single sample obtained 
from the patient. 

16. A method of claim 14 wherein the first and second samples are portions of pooled samples obtained 
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from the patient. 

17. A method of assessing the efficacy of a therapy for inhibiting ovarian cancer in a patient, the 
method comprising comparing: (a) levels of a plurality of kallikrein markers, optionally CA125, 
and/or polynucleotides encoding same, in a first sample obtained from the patient, wherein the 

5 kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 

kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and (b) levels of the kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, in a second sample obtained from the 
patient following therapy, wherein a significant difference in the levels of expression of the 
kallikrein markers, optionally CA125, and/or polynucleotides encoding same, in the second sample, 

10 relative to the first sample, is an indication that the therapy is efficacious for inhibiting ovarian 

cancer in the patient 

18. A method of selecting an agent for inhibiting ovarian cancer in a patient the method comprising (a) 
obtaining a sample comprising cancer cells from the patient; (b) separately exposing aliquots of the 
sample in the presence of a plurality of test agents; (c) comparing levels of a plurality of kallikrein 

15 markers, optionally CA125, and/or polynucleotides encoding same, in each of the aliquots, wherein 

the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 
6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (d) selecting one of the test agents 
which alters the levels of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, in the aliquot containing that test agent, relative to other test agents. 

20 19. A method of inhibiting ovarian cancer in a patient, the method comprising (a) obtaining a sample 
comprising cancer cells from the patient; (b) separately maintaining aliquots of the sample in the 
presence of a plurality of test agents; (c) comparing levels of a plurality of kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, in each of the aliquots, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 

25 kallikrein 7, kaUikrein 8, kallikrein 10, and kallikrein 11; and (d) administering to the patient at 

least one of the test agents which alters the levels of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, in the aliquot containing that test agent, relative to other test agents. 

20. A method of assessing the ovarian cell carcinogenic potential of a test compound, the method 
comprising: (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

30 compound; and (b) comparing expression of a plurality of markers, optionally CA125, and/or 

polynucleotides encoding same, in each of the aliquots, wherein the markers comprise or are 
selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 
10, and kallikrein 1 1, and wherein a significant difference in levels of kallikrein markers, optionally 
CA125, and/or polynucleotides encoding same, in the aliquot maintained in the presence of the test 

35 compound, relative to the aliquot maintained in the absence of the test compound, is an indication 

that the test compound possesses ovarian cell carcinogenic potential. 

21. A method of inhibiting ovarian cancer in a patient at risk for developing ovarian cancer, the method 
comprising inhibiting expression of genes encoding kallikrein markers and optionally CA125, 
wherein the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
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kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1. 

22. A method of any preceding claim wherein the plurality comprises at least three of the markers. 

23. A method of any preceding claim wherein the plurality comprises at least five of the markers. 

24. A method of any preceding claim wherein the plurality of kallikrein markers is selected from the 
, 5 group consisting of kallikrein 5, kallikrein 7, and kallikrein 8; kallikrein 5, kallikrein 8, and 

kallikrein 10; kallikrein 7, kallikrein 8, and kallikrein 10; kallikrein 5, kallikrein 7, kallikrein 8, and 
kallikrein 10; kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; kallikrein 5, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11; or kallilkrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10 and kallikrein 11. 
10 25. A method of any proceeding claims wherein the kallikrein markers are kallikrein 7, kallikrein 8, 
kallikrein 10 and kallirkein 11. 

26. A method of any preceding claim wherein the patient sample comprises serum obtained from the 
patient 

27. A kit for carrying out a method as claimed in any preceding claim. 

15 28. A kit for assessing whether a patient is afflicted with ovarian cancer, the kit comprising reagents 
that specifically bind with a plurality of kallikrein markers and optionally CA125, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 
kalhkrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 

29. A kit for assessing the suitability of each of a plurality of agents for inhibiting ovarian cancer in a 
20 patient, the kit comprising: (a) the plurality of agents; and (b) reagents for detecting a plurality of 

kallikrein markers and optionally CA125, wherem the kallikrein markers comprise or are selected 
from the group consisting of kallikrem 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and 
kallikrein 11. 

30. A kit as claimed in claim 28 or 29 wherein the reagents are antibodies that specifically bind with 
25 protein or protein fragments corresponding to kallikrein markers and optionally CA125. 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure? 
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Sequence Listing 



SEQ ID NO. 1 

CA125 amino acid 

5 1 mlkpBglpgs ssptrslmtg srstkatpem dsgltgatls pktstgaiw tehtlpf tsp 

61 dktlasptss wgrttqslg vrassalpest srgmthseqr tspslspqvn gtpsmypat 
121 BitrvsglsBpr trtsstegnf tkeastytlt vettsgpvte kytvptetst tegdstetpw 
181 dtryipvkit spmktfadst askenapvsm tpaettvtda htpgrtnpsf gtlyssfldl 
241 spkgtpnsrg etslelilst tgypfsspep gsaghsriat saplsssasv Idnkisetsi 
10 301 fsgqsltspl spgvpearas tmpnsaipfs mtlsnaetsa ervrstissl gtpsistkqt 

361 aetiltfhaf aetradipsth iaktlasewl gepgtlggts tsaltttsps ttlvseetnt 
421 hhstsgkete gtlntsmtpl etsapgeese mtatlvptlg fttldskirs psqvssshpt 
481 relrttgsts grqssstaah gssdilratt sstskasswt sestaqqf se pqhtqvrvete 
541 psmkterppa stsvaapitt svpswsgft tlktsstkgi wleetsadtl igestagptt 
15 601 hqfavptgis mtggsstrgs qgtthlltra tassetsadl tlatngvpvs vspavsktaa 

661 gssppggtkp sytmvssvip etsslqssaf regtalgltp Intrhpfssp epdsaghtki 
721 stsipllssa svledkvsat stfshiikats sittgtpeis tktkpssavl ssratlsnaat 
781 spervmats plthpspsge etagsvltls tsaettdspn ihptgtltse ssespstlsl 
841 psvsgvkttf ssstpsthlf tsgeeteets npsvsqpets vsrvrttlas tsvptpvfpt 
20 901 radtwptrsaq fssBhlvsel ratsstsvtn stgsalpkis hltgtatmsq tixrdtfndsa 

961 apqsttwpet sprfktglps atttvstsat siBatvmvsk ftspatssme atsirepstt 
1021 ilttettngp gsmavastni pigkgyiteg rldtshlpig ttassetsmd ftmakeavsra 
1081 svspsqsrada agsstpgrts qfvdtfsddv yhltsreiti prdgtssalt pqratathpps 
1141 pdpgsarstw Igilssspss ptpkvtmsst fBtqrvttsm imdtvetsrw nnpnlpstts 
25 1201 Itpsniptsg aigkstlvpl dtpspatsle asegglptls typeatntps ihlgahasse 

12 61 spstikltma swkpgsytp Itfpsiethi hvstarraays sgsspemtap getntgstwd 
1321 pttyitttdp kdtssaqvst phsvrtlrtt enhpkteBat paaysgspki BsspnltBpa 
1381 tkawtitdtt ehstqlhytk laekssgfet qsapgpvsw iptsptigss tleltsdvpg 
1441 eplvlapseq ttitlpmatw Istslteema stdldissps sptnstfaifp pitistpshels 
30 1501 kaeadtsair ntdsttldqh Igirslgrtg dlttvpitpl tttwtsvieh. stqaqdtlsa 

1561 tmspthvtqB Ikdqtsipas aBpshltevy pelgtqgrss seattfwkps tdtlareiet 
1621 gptniqstpp mdntttgssB sgvtlgiahl pigtsspaet BtnmalerrB Btatvsmagt 
1681 nigllvtsapg rsiBqslgrv sBvlsestte gvtdsakgBS prlntqgnta IssslepBya 
1741 egsqmstBip Itsspttpdv eflggatfwt kevttvmtsd iskss£u:tes Bsatlmstal 
35 1801 gstentgkek Irtasmdlps ptpsmevtpw isltlsnapn ttdaldlshg vhtssagtla 

1861 tdrslntgvt rasrlengsd tsskslsmgn sthtsmtdte ksevssBihp rpetaapgae 
1921 ttltstpgnr aisltlpfBS ipveevistg itsgpdinsa pmthspitpp tivwtstgti 
1981 eqstqplhav ssekvsvqtq stpyvnsvav saspthensv ssgsstsspy ssasleslda 
2041 tisrmaitB wlwdlttBlp tttwpstsls ealssghsgv snpsstttef plfsaastsa 
40 2101 akqmpetet hgpqntaast Intdassvtg Isetpvgasi ssevplpmai tsrsdvsglt 

2161 sestanpslg tassagtklt rtislptses Ivafminkdp wtvsiplgsh pttnbetsip 
2221 vnsagppgls tvasdvidtp sdgaesiptv sfspspdtev ttishfpekt thafrtissl 
2281 theltsrvtp ipgdwmssam atkptgasps itlgerrtit saapttspiv Itasftetst 
2341 vsldnettvk tsdildarkt nelpsdssss sdlintsias Btmdvtktas isptsisgmt 
45 2401 asBBpslfss drpqvptstt etntatspav ssntysldgg snvggtpatl ppftithpve 

2461 tasallawsr pyrtfatnivs tdtasgenpt ssnswtsvp apgtwasvgs ttdlparagfl 
2521 ktspageahs llastiepat aftphlsaav vtgasatsea allttseska ihBspqtptt 
2581 ptsganwets atpesllwt etadttltak ilvtdtilfs tvstppakfp stgtlsgasf 
2641 ptllpdtpai pltateptss latsfdBtpl vtiasdslgt vpettltmse tsngdalvlk 
50 2701 tvsnpdrsip gitiqgvtes plhpsstsps kivapmtty egaitvalst Ipagttgslv 

2761 fsqasenset talvdssagl eraavmpltt gsqgmassgg iragsthstg tktfsslplt 
2821 mnpgevtams eittnrltat qstapkgipv kptsaesgll tpvsasssps kafaslttap 
2881 pstwgipqst Itfefsevps Idtksaslpt pgqalntipd sdastasBsl skspeknpra 
2941 rmmtstkais assfgstgft etpegsasps magheprvpt sgtgdpryas esmsypdpsk 
55 3001 aBsaratstsl asklttlfst gqaarsgsss spialsteke taflsptast srktslflgp 

3061 smarqpnilv hlqtsaltls ptstlnrasqe eppeltssqt iaeeegttae tqtltftpse 
3121 tptsllpvBS pteptarrks spetwassis vpaktslvet tdgtlvttik mssqaaqgns 
3181 twpapaeetg tspagtspgs pevsttlkim sskepslBpe irBtvrnapw ktpettvpme 
3241 ttvepvtlqa talgsgatsi shlptgttap tksptenrala tervslspsp peawtnlysg 
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3301 tpggtrqsla tmssvslesp tarsitgtgq qsspelvskt tgraefsmwhg stggttgdth 
3361 vslstBsnil edpvtspnsv ssltdkskhk tetwvsttai pstvlnnkim aaeqqtsrsv 
3421 deaysstssw sdqtsgsdit Igaspdvtnt lyitstaqtt slvslpsgdq gitsltnpsg 
3481 gktssassvt spsigletlr anvsavksdi aptaghlsqt sspaevsild vttaptpgis 
5 3541 ttittmgtnB istttpnpev gmstmdstpa terrttsteh pstwsstaas dswtvtdxnts 

3601 nlkvarspgt Istmhttsfl assteldBms tphgritvig tslvtpssda savktetets 
3661 ertlspsdtt astpistfsr vqrmsisvpd ilstswtpss teaedvpvsm vstdhastkt 
3721 dpntplstfl fdslstldwd tgrslssata ttsapggatt pqeltletmi spatsqlpfs 
3781 ighitsavtp aamarssgvt fsrpdptskk aeqtstqlpt ttsahpgqvp rsaattldvl 

10 3B41 phtsOctpdat fqrqgqtalt tearatsdsw nekekstpsa pwitemmnsv aedtikevts 

3901 sssvlkdpey aghklgiwdd fipkfgkaah mrelpllspp gdkealhpst ntvettgwvt 
3961 ssehashstl pahsassklt spwttstre qaivsmsttt wpestraxte pnsfltielr 
4021 dvspymdtss ttqtsiissp gstsdtkgpr teitsskrls sBflagsmrs sdspseaitr 
40B1 Isnfpamtes ggmilatnqtB ppgatslsap tldtsatasw tgtplattqr ftysekttlf 

15 4141 Bkgpedtsqp sppsveetss asslvpihat tspsnillts gghspsstpp vtsvflsets 

4201 glgkttdmsr i&lepgtslp pnlsBtagea Istyeasrdt kalhhsadta vtnmeatsse 
4261 yspipghtkp skatsplvts himgditsst svfgssette ietvssvnqg Iqerstsqva 
4321 Bsatetstvi thvssgdatt hvtktqatfs sgtsissphq fltstntftd vstnpBtsll 
4381 mtessgvtit tqtgptgaat qgpylldtst nrpyltetpla vtpdfmqsek ttliskgpkd 

20 4441 vtwtsppsva etsypssltp flvttippat stlqgqhtss pvsatsvltB glvkttdmln 

4501 tsmepvtzisp qQlnnpsnel latlaattdi etihpsinka vtnmgtassa hvlhstlpvs 
4561 sepstatspm vpasstngdal asisipgset tdiegeptss Itagrkenst Iqemnsttes 
4621 niilsnvsvg aiteatkmev psfdatfipt paqstkfpdi f svassrlsn sppmtisthm 
4681 tttqtgssga tskiplaldt stletsagtp swtegfahs kittamnndv kdvsqtnppf 

25 4741 qdeasspssq apvlvttlps svaftpqwhs tSBpvsmBBv ItSBlvktag kvdtsletvt 

4801 sspqamsntl ddisvtsaat tdietthpsi ntwtnvgtt gsafeshstv saypepskvt 
4861 spnvttstme dttisrsipk sskttrtete ttssltpklr etsisqeita stetstvpyk 
4921 eltgattevs rtdvtBSSSt sfpgpdqstv sldistetnt rlstspirate saeitittqt 
4981 gphgatBqdt ftradpsnttp qagihsamth gfsqldvttl msripqdvsw tsppsvdkts 

30 5041 spesflsspa mttpslisst Ipedklsspm tslltsglvk itdilrtrle pvtsslpnfs 

5101 stsdkilats kdskdtkeif painteetnv kannsghesh spaladsetp kattqrwitt 
5161 tvgdpapsts mpvhgssett nikreptyfl tprlretsts qessfptdts fllskvptgt 
5221 itevsBtgvn ssskistpdh dkstvppdtf tgeiprvfts siktksaerat ittqasppes 
5281 ashstlpldt sttlsqggth stvtqgfpys evttlmgmgp gnvswmttpp veetssvssl 

35 5341 msBparatsps pvsstspqsi pssplpvtal ptsvlvtttd vlgttspesv tssppnlssi 

5401 therpatykd tahteaamhh stntavtnvg tsgsghksqs svladsetsk atplmsttst 
5461 Igdtsvstst pnisqtnqiq teptaslspr Iresstsekt ssttetntaf syvptgaitq 
5521 asrteisssr tsisdldrpt iapdistgmi trlftspimt ksaetntvttq tttpgatsqg 
5581 ilpwdtsttl fqggthBtvs qgfphseitt Irsrtpgdvs wmttppveet BBgfslmsps 

40 5641 ratspspvsst spesipsapl pvtalltsvl vtttnvlgtt spetvtsspp nlssptqerl 

5701 ttykdtahte amhasmhtnt avanvgtsis ghesqssvpa dshtskatsp mgitfamgdt 
5761 BVBtstpa£f etrlqtests slipglrdtr tseelntvte tstvlsevpt ttttevsrte 
5821 vitssrttis gpdhskmspy Istetitrls tfpfvtgste maitnqtgpl gtlsqatltl 
5881 dtSBtasweg thspvtqrfp haeetttmsr stkgvswqsp psveetssps spvplpaits 

45 5941 hsBlysavsg ssptsalpvt Blltsgrrkt Idmldthsel vtsslpsass fsgelltsea 

6001 stntetihfs entaetnmgt tnsmhklhss vslhsqpsgh tppkvtgsnnn edaivststp 
6061 gspetknvdr dstspltpel kedstalvmn Bttesntvfs svaldaatev sraevtyydp 
6121 tfmpasaqst kspdispeas sshsnspplt isthktlatq tgpsgvtslg qltldtstia 
6181 tsagtpsart qdfvdsetts vmxmdlndvl ktspfsaeea nslssqapll vttspspvts 

50 6241 tlqehstssl vsvtsvptpt leUcitdindtn lepvtrspqn Irntlatsea ttdthtmhps 

6301 intamanvgt tsspnefyft vspdsdpyka tsawitsts gdsivstsnrp rssamkkies 
6361 ettfsllfrl retstsqkig essdtstvfd kaftaattev Brteltsssr tsiggtekpt 
6421 mspdtstrsv tmlstfaglt kseertiatq tgphratsqg tltwdtsitt sqagthsamt 
6481 hgfsqldlst Itsrvpeyls gtappsvekt ssasBllelp aitspspvpt tlpesrpssp 

55 6541 vhltslptsg Ivkttdralas vaslppnlgs tshkipttse dikdtekmyp stxiiavtnvg 

6601 tttsekesya svpayseppk vtBptnvtsfxi Irdtivstsm pgsseltrie mestfavahg 
6661 Ikgtstsqdp ivsteksavl hklttgatet srtevassrr tsipgpdhst espdistevi 
6721 pslpislgit essnmtiitr tgpplgstsq gtftldtptt ssragthsma tqefphserat 
6781 tvmnkdpeil swtippsiek tsfssalmps pamtsppvss tlpktihttp apmtslltps 

60 6841 Ivmttdtlgt apepttsspp nlsstshvil ttdedttaie arahpststaa tnvettcsgh 

6901 gaqasvltds ektkatapmd ttstmghttv stsrasvsset tkikrestys Itpglretsi 
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6961 sqnasfstdt sivlsevptg ttaevsrtev tssgrtsipg psqstvlpei strttntrlfa 
7021 sptmtesaera tiptqtgpsg steqdtltld tsttksqakt hstltqrfph semttlmsrg 
70B1 pgdmswqssp slenpsslps llslpattsp ppisstlpvt isssplpvts lltsspvttt 
7141 dralhtspelv tssppklsht sderlttgkd ttnteavhps tntaasnvei psfghespss 
5 7201 aladsetska tspmfitstq edttvaistp hfletsriqk esisslspkl retgssvets 

7261 saietsavls evsigattei srtevtsssr tsisgsaest mlpeisttrk iikfptspil 
7321 aessemtikt qteppgstse stftldtstt pslvithstm tqrlphseit tlvsrgagdv 
7381 prpsslpvee tsppssqlsl samispspvs stlpasshss sasvtspltp gqvkttevld 
7441 asaepetssp pslsstsvei latsevttdt ekihpfpnta vtkvgtsssg hespssvlpd 

10 7501 settkatsam gtislmgdts vstltpalsn trkiqsepas slttrlrets tseet slate 

7561 antvlBkvst gattevsrte aisfsrtsms gpeqstmsqd islgtipris assvltesak 
7621 mtittqtgps estlestlnl ntattpswve thsivlqgfp hpemttsmgr gpggvswpsp 
7681 pfvketspps splslpavts ph.pvsttfla hippsplpvt slltsgpatt tdilgtstep 
7741 gtssBBslst tsherlttyk dtahteavhp stntggtnva ttssgyksge svladsspmc 

15 7801 ttetmgdtBv Itstpaflet rriqtelaas Itpglreasg segtssgtkm stvlskvptg 

7861 atteiskedv tsipgpaqst ispdistrtv swfstepvint esaeitmnth tsplgattqg 
7921 tstlatBBtt sltmthstis qgfshsqmst Imrrgpedvs vmisppllekt rpsfslmssp 
7981 attspspvss tlpeaisssp Ipvtslltsg lakttdmlhk ssepvtnspa nlsstsveil 
8041 atsevttdte kthpssnrtv tdvgtsssgh estsfvlada qtakvtsprav itstraedtav 

20 8101 ststpgffet ariqteptss Itlglrktsa aegtslatem stvlsgvptg ataevartev 

8161 tssertslag faqltvspet stetitrlpt ssiratesaem raiktqtdppg stpesthtvd 
8221 isttpnwvet hstvtqrfsh semttlvsrs pgdmlwpsqs sveetssass llslpattsp 
8281 spvsBtlved fpsaslpvts lltpglvitt drmgisrepg tsstsnlsst sherlttled 
8341 tvdtedmqps thtavtnvrt sisghesqss vlsdsetpka tspmgttytm getsvsiets 

25 8401 dffetsriqi eptssltsgl retssseris sategstvls evpsgattev srtevisarg 

8461 tsmsgpdqft ispdisteai trlstspimt esaesaitie tgspgatseg tltldtsttt 
8521 fwsgthstas pgfshsemtt linsrtpgdvp wpslpsveea ssvssslasp amtstsffsa 
8581 Ipeaisssph pvtalltlgp vkttdmlrts sepetssppn Isstsaeila tsevtkdrek 
8641 ihpasntpw nvgtviykhl spsevladlv ttkptspmat tstlgntsvs tatpafpetm 

30 8701 mtqptsalts glreistsqe tssatersae Isgmptgatt kvsrtealsl grtstpgpaq 

8761 stispeiste titristplt ttgaaemtit pktghsgass qgtftldtas raswpgthsa 
8821 athrsphsgm ttpmsrgped vswpsrpsve ktsppsslvs Isavtspspl ystpsessha 
8881 splrvtslft pvmmkttdml dtslepvtta ppsronitsde slatskatrae teaiqlsent 
8941 avtqragtisa rqefyasypg Ipepakvtsp wtsstikdi vsttipasse itriemests 

35 9001 tltptprets tsqeihsatk pstvpykalt satiedsmtq vmsssrgpsp dqstmsqdis 

9061 sevitrlsts pikaestemt ittqtgspga tsrgtltldt sttfrnagths tasqgfshsq 
9121 ratalmsrtpg dvpwlshpsv eeassasfsl sspvratsssp vsstlpdsih asslpvtsll 
9181 tsglvkttel Igtssepeta sppnlsstsa eilattevtt dteklemtnv vtsgytheap 
9241 ssvladavtt katssmgity ptgdtnvlta tpafsdtsri qtkaklaltp glmetaiaee 

40 9301 tssatekatv Isavptgatt evsrteaiss srtsipgpaq strasadtame titristplt 

9361 rkestdmait pktgpsgats qgtftldsss taswpgthsa ttqrfpqsw ttpmargped 
9421 vawpsplsve knsppsslvs sssvtspapl ystpsgashs spvpvtslft aiimnkatdml 
9481 daalepetta apnmnitsde slatskatte teaihvfent aashvettsa teelyssspg 
9541 faeptkvisp wtsssirdn ntvsttmpgss gitrieiesm ssltpglret rtsqditsst 

45 9601 etstvlykms sgatpevsrt evmpssrtsi pgpaqstmsl disdewtrl etspimtesa 

9661 eitittqtgy alatsqvtlp Igtamtflsg thstmsqgls haemtnlmsr gpeslswtsp 
9721 rfvettrsBB sltslpltts Ispvsstlld aapsaplpvt slilpglvkt tevldtsaep 
9781 ktssspnlBB tsveipatse Imtdtekihp ssntavakvr tsssvheshs svladsetti 
9841 tipsmgitBa vddttvftsn pafsetrrip teptfsltpg fretstseet tsitetsavl 

50 9901 ygvptaatte vsmteimssn rthipdsdqs tmspdiitev itrlsBssmm aestqmtitt 

9961 qksspgataq stltlattta plarthstvp prflhsemtt Imsrspenps wksspfvekt 
10021 sssssllslp vttspsvsst Ipqsipsssf Bvtslltpgm vkttdtstep gtslspnlag 
10081 tsveilaase vttdtekihp ssstnavtnvg ttssghelya svsihaepak atypvgtpss 
10141 roaetsistsm panfettgfe aepfshltag frktnrasldt ssvtptntps spgsthllqs 

55 10201 sktdftssak tsepdwppaa qyteipvdii tpfnaspaxt estgitsfpe srftmsvtes 

10261 thhlstdllp saetistgtv mpslaeamts fattgvprai sgsgspfart esgpgdatls 
10321 tiaeslpsst pvpfsaatft ttdsetipal heitsssatp yrvdtslgte ssttegrlvm 
10381 vstldtssqp grtsstpild trmtesvelg tvtsayqvps iBtrltrtdg imehitkipn 
10441 eaahrgtirp vkgpqtstsp aspkglhtgg tkrmetttta ikttttalkt tsratlttsv 

60 10501 ytptlgtltp Inasrqmast iltemmittp yvfpdvpett sslatslgae tstalprttp 

10561 svlnresett aslvsrsgae rspviqtldv sssepdttas wvihpaetip tvskttpnff 
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10621 hseldtvsst 
10681 trttwlthpa 
10741 sqvtssgtdr 
10801 slaaktsttn 
5 10861 aesssavptp 

10921 saiptptvsp 
10981 tvlpevpgmv 
11041 pgwtslvts 
11101 tplvtssrav 
10 11161 sravtsttip 

11221 tsafsnltva 
11281 rttsrfshse 
11341 espheseata 
11401 dvpdmvt8(jv 
15 11461 mltslvissg 

11521 pvaitspgpe 
11581 thpaetsttv 
11641 atdtstaipt 
11701 tpvsrttssf 

20 11761 ptlthspgnqp 

11821 tsslftllvt 
11881 atsssaetst 
11941 tgttmtlips 
12001 gslftplttp 

25 12061 tBsipsstaa 

12121 leylysgcrl 
12181 ytldrnslyv 
12241 tnlqyeedmr 
12301 daicthrldp 

30 12361 pgtstvdlrt 

12421 vlqgllgpif 
12481 elsqltagik 
12541 agpllvlftl 
12601 tllrsekdga 

35 12661 ngfthwipvp 

12721 pgsrkfntte 
12781 spgvdreqly 
12841 gtpsslpspt 
12901 svgplysgcr 

40 12961 pytldrnsly 

13021 itnlqyeedm 
13081 mdaicshrld 
13141 tpgtstvdlg 
13201 Iqgllgplfk 

45 13261 Isqmtngike 

13321 gpllvpftln 
13381 slrpekdgaa 
13441 gfthqnsvpt 
13501 pgsrkfntte 

50 13561 rpgldreqly 

13621 gtpasfpght 
13681 svgplysgcr 
13741 pytldrdsly 
13801 itnlhyeenm 

55 13861 vdticthrvd 

13921 tpgtstvxxg 
13981 Iggllgpmfk 
14041 Isqlthgike 
14101 vpllvpftln 

60 14161 slrsekdgaa 

14221 gfthrssglt 



atshgadvss 

etsstiprti 

nmtiptltls 

raltnspgep 

tvstevpgw 

gvpgwtslv 

tslvassrav 

Bsgvnstsip 

tsttipiltl 

tltissdepe 

ssqpetidsw 

Idtn^stvts 

swvthpavtB 

tssgtdtslt 

tdstttfptl 

assavsttti 

sgtipnfshr 

Itpspgepet 

shsapdatpv 

ettallsthp 

gtsrvdlspt 

stltltvspa 

eraptppktsh 

gmstlasesv 

tvpfmvpftl 

aslirpekdss 

ngf thrssnp 

rtgsrkfntm 

kspglnreql 

sgtpsslssp 

kntsvgplys 

elgpytldm 

nf titnlkye 

atgvdaicth 

tsstpgtstv 

rvlqgllgpm 

weleqltngi 

sagpllvpft 

Itllrsekdg 

vngfthqtsa 

hhpgsrkfnt 

pkspglnreq 

tsgtpsslps 

nssvgplysg 

Igpytldrns 

f tltnlqyee 

tgmdavclyh 

tstpgtstvy 

rvlqgllkpl 

welsqlthnl 

epgpllipft 

Itllzpekhe 

vngfnprsBV 

qhpgsrkfnt 

plgpglxxex 

tsgtpss^qsx 

ntsvgllysg 

Igpytldrns 

f titnlqyge 

tgvdaicthh 

tstpwtstvd 



aiptnispse 
pnfshhesda 
pgepktiasl 
attvslvthp 
tplvtssrav 
tssravtstt 
tsttlptltl 
tlilspgele 
sssepettps 
tttslvthse 
vahpgteass 
peaesssais 
ttvprttpny 
iptltlssge 
tetpyepett 
spdmsdlvts 
gsdtapsmvt 
tassathpgt 
matsprteas 
rtgtsktfpa 
aspgvsakta 
vsglssasit 
gegvspttil 
tsrtsynhrs 
nftitnlqye 
amavdaicth 
ttstpgtstv 
esvlqgllkp 
ywelskltnd 
timaagpllv 
gcrltslrse 
slyvngfthr 
edrahrpgsrk 
rldpkspgld 
dlgsgtpssl 
fkntsvglly 
kelgpytldr 
Inftitnlqy 
aatgvdaict 
pntstpgtst 
tervlqgllg 
lywelsqlth 
pttavpllvp 
crlislrsek 
lyvngfthrs 
dmhrpgsrkf 
pnpkrpgldr 
wattgtpssf 
fkntsvgply 
telgpysldr 
fnftitnlhy 
aatgvdtict 
pttstpgtst 
tervlqgllk 
lywelsxltx 
xtsagpllvp 
crltllrpek 
lyvngfthrs 
dmrhpgarkf 
Inpqspgldr 
Igtsgtpspv 



Idaltplvti 
tpsiatspga 
vthpeaqtss 
aqtsptvpwt 
isttipiltl 
ipiltf slge 
spgepettps 
ttpsmatshg 
matshgveas 
akmisaiptl 
wptltvstg 
ttispgipgv 
shsepdttps 
petttsfity 
aiqlihpaet 
Ivpssgtdts 
spgvdtrsgv 
qtgftvpirt 
savlttispg 
stvfpqvset 
plsthpgtet 
tdkpqtvtsw 
rttmveatnl 
wisttssynr 
edmrhpgsrk 
rpdpedlgld 
dvgtsgtpss 
Ifkntsvgpl 
ieelgpytld 
pftlnftitn 
kdgaatgvda 
tsvpttstpg 
fnttervlqt 
reqlywelsq 
psptaagpll 
sgcrltllrs 
nslyvngf th 
eedmrhpgsr 
hrldpkspgv 
vdlgtsgtps 
pmfkntsvgl 
gikelgpytl 
ftlnftitnl 
dgaatgvdai 
sglttstpwt 
nttervlqgl 
eqlywelsql 
pghtepgpll 
sgcrltslrp 
dslyvngfth 
eenmqhpgsr 
hrvdpigpgl 
vhlatsgtps 
plfkntsvgp 
xixelgpytl 
ftlnftitnl 
ngaatgmdai 
svaptstpgt 
nttervlqgl 
eqlywqlsqm 
pspttagpll 



sgtdtsttfp 

etssaipimt 

aiptstispa 

tsiffhsksd 

spgepettps 

pettpsmats 

matshgaeas 

aeassavptp 

savltvspev 

avsptvqglv 

epftnislvt 

Itslvtssgr 

iatspgaeat 

sethtssaip 

ntiDvpkttpk 

ttfptlsetp 

ptttippsip 

vpssepdtma 

apemvtsqit 

tasltirpga 

stmiptstls 

ntetspsvts 

attgssptva 

rywtpatstp 

fnaterelqg 

rerlywelsn 

spsptaagpl 

ysgcrltllr 

rnslyvngft 

Iqygedmghp 

icihhldpks 

tstvdlgtsg 

llgpmfknts 

Itngikelgp 

vpftlnftit 

ekdgaatgvd 

qtsapntatp 

kfnttervlq 

dreqlywels 

slpsptsagp 

lysgcrltll 

dmslyvngf 

qygedrorhpg 

cthhlnpqsp 

stvdlgtsgt 

Ispifknssv 

thnitelgpy 

ipftfnftlt 

ekdgaatgmd 

qnsvpttstp 

kfnttervlq 

drerlywels 

slpghtapvp 

lysgcrltll 

drxslyvngf 

C[yeedinhhpg 

Gshrldpksp 

stvdlgtsgt 

Igplfknssv 

tngikelgpy 

vpftlnftit 



tltksphete 
vspgaedlvt 
vsrlvtsntvt 
ttpsmttshg 
matshgeeas 
hgteagsavp 
stvptvspev 
tvspgvsgvv 
pgmvtslvta 
tslvtsagse 
hpaeasatlp 
disatfptvp 
sdfptitvsp 
tlpvspgask 
f shsksdttl 
yepettvtwl 
gwtsqvtSB 
swvthppqts 
ssgaatsttv 
etstalptqt 
Igllettgll 
vgppef srtv 
kttttfntla 
vtstf spgis 
llkplf rnss 
Itngiqelgp 
Itnpftlnfti 
pekdgaatgv 
hqssvattst 
gsrkfntter 
pglnrerlyw 
tpf slpspat 
vgllysgcrl 
ytldrnslyv 
nlqyeedrahh 
aicthrldpk 
gtstvdlgts 
gllkplfkst 
qltngikelg 
llvpftlnft 
rpekngaatg 
thrssvapts 
srkfntterv 
gldreqlywq 
pspvpsptta 
gplysgcrlt 
sldrdslyvn 
nlhyeenmqh 
avclyhpnpk 
gtstvywatt 
g llkplf lent 
qltnsitelg 
llipftlnft 
rpekheaatg 
thxxsxptts 
srkfntterv 
gldreqlywe 
psslpsptta 
gplysgcrli 
tldmslyvn 
nlqyeedmhr 
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14281 pgsrkfnate rvlqgllspi fknssvgply sgcrltslrp ekdgaatgmd avclyhpnpk 
14341 rpgldreqly welsqlthni telgpysldr dslyvngfth qsamtttrtp dtstmhlats 
14401 rtpaslsgpt taspllvlft xnctitnlqy eedmrrtgsr kfntmesvlq gllkplfknt 
14461 svgplysgcr Itllrpkkdg aatgvdaict hrldpkspgl nreqlywels kltndieelg 
5 14521 pytldmsly vngfthqssv sttstpgtst vdlrtsgtps slssptlmxx xpllaqpftxn 

14581 xtitnlxxxx xin3aq)gsrkf nttervlggl Irplfkntsv sslysgcrlt llrpekdgaa 
14641 trvdaactyr pdpkspgldr eqlywelsql thsitelgpy tldrvslyvn gfnprssvpt 
14701 tstpgtstvh latsgtpssl pghtxxxpll ^ftxnxtit nlxxxxxmxx pgsrkfntte 
14761 rvlqgll]cpl fmssleyly sgcrlaslrp ekdssamavd aicthxpdpe dlgldrerly 

10 14821 welsziltngi qelgpytldr nslyvngfth rssglttstp wtstvdlgts gtpspvpspt 

14881 tagpllvpft Inftitnlqy eedmhrpgsr rfnttervlq glltplfknt svgplysgcr 
14941 Itllrpekqe aatgvdtict hrvdpigpgl drerlywels qltnsitelg pytldrdsly 
15001 vngfnpwBBv pttstpgtst vhlatsgtps slpghtapvp llipftlnft itdlhyeenm 
15061 qhpgsrkfzit tervlqgllk plfkstsvgp lysgcrltll rpekhgaatg vdaictlrld 

15 15121 ptgpgldrer lywelsqltn svtelgpytl drdslyvngf thrssvptts ipgtsavhle 

15181 tsgtpaslpg htapgpllvp ftlnftitnl qyeedmrhpg srkfstterv Iqgllkplfk 
15241 ntsvsBlysg crltllrpek dgaatrvdav cthrpdpksp gldrerlywk Isqlthgite 
15301 Igpytldrhs lyvngfthqs smtttrtpdt stmhlatsrt paslsgptta spllvlftin 
15361 ftitnlryee ntnhhpgsrkf nttervlqgl Irpvfkntsv gplysgcrlt tlrpkkdgaa 

20 15421 tkvdaictyr pdpkspgldr eqlywelsql thsitelgpy tqdrdslyvn gfthrssvpt 

15481 tsipgtsavh letsgtpasl pghtapgpll vpftlnftit nlqyeedmrh pgsrkfntte 
15541 rvlqgllkpl fkstsvgply sgcrltllrp ekrgaatgvd ticthrldpl npgldreqly 
15601 welskltrgi ielgpylldr gslyvngfth rtsvpttstp gtstvdlgts gtpfslpspa 
15661 xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttervlq tllgpmfknt svgllysgcr 

25 15721 Itllrsekdg aatgvdaict hrldpkspgv dreqlywels qltngikelg pytldmsly 

15781 vngfthwipv ptsstpgtst vdlgsgtpss Ipspttagpl Ivpftlnfti tnlkyeedmh 
15841 cpgsrkfntt ervlqsllgp mfkntsvgpl ysgcrltllr sekdgaatgv daicthrldp 
15901 kspgvdreql ywelsqltng ikelgpytld mslyvngft hqtsapntst pgtstvdlgt 
15961 sgtpsslpsp txxxpllxpf txnxtitnlx xxxxmxxpgs rkfnttexvl qgllxpxfkn 

30 16021 xsvgxlysgc rltxlrxekx gaatgxdaic xhxxxpkxpg Ixxexlywel sxltxxixel 

16081 gpytldrxsl yvngfthwip vptsstpgts tvdlgsgtps slpspttagp llvpftlnft 
16141 itnlkyeedm hcpgsrkfnt tervlqsllg prafkntsvgp lysgcrltsl rsekdgaatg 
16201 vdaicthrvd pkspgvdreq lywelsqltn gikelgpytl drnslyvngf thqtsapnts 
16261 tpgtstvxxg tsgtpssxpx xtsagpllvp ftlnftitnl qyeedmhhpg srkfntterv 

35 16321 Iqgllgprafk ntsvgllysg crltllrpek ngattgmdai cthrlt^ksp glxxexlywe 

16381 Isxltxxixe Igpytldrxs lyvngfthxx sxpttstpgt stvxxgtsgt pssxpxxtxx 
16441 xpllxpftxn xtitnlxxxx xmxxpgsrkf nttervlqgl Ikplfmssl cylysgcrla 
16501 slrpekdssa inavdaicthr pdpedlgldr erlywelsnl tngiqelgpy tldmslyvn 
16561 gfthrssmpt tstpgtstvd vgtsgtpsss pspttagpll ipftlnftit nlqygedtngh. 

40 16621 pgsrkfntte rvlqgllgpi fkntsvgply sgcrltslrs ekdgaatgvd aicihhldpk 

16681 spglnrerly welsqltngi kelgpytldr nslyvngfth rtsvpttstp gtstvdlgts 
16741 gtpfslpspa tagpllvlft Inftitnlky eedmhrpgsr kfnttervlq tllgpmfknt 
16801 svgllysgcr Itllrsekdg aatgvdaict hrldpkspgl xxexlywels xltxxixelg 
16861 pytldrxsly vngfthxxsx pttstpgtst vxxgtsgtps sxpxxtxxxp llxpf txnxt 

45 16921 itnlxxxxxm xxpgsrkfnt tervlqgllr pvfkntsvgp lysgcrltll rpkkdgaatk 

16981 vdaictyrpd pkspgldreq lywelsqlth sitelgpytq drdslyvngf thrssvptts 
17041 ipgtsavhle ttgtpssfpg htepgpllip ftfnftitnl ryeenraqhpg srkfntterv 
17101 Iqglltplfk ntsvgplysg crltllrpek qeaatgvdti cthrvdpigp gldrerlywe 
17161 Isqltnsite Igpytldrds lyvdgfnpws svpttstpgt stvhlatsgt psplpghtap 

50 17221 vpllipftln ftitdlhyee nmqhpgsrkf nttervlqgl Ikplfkstsv gplysgcrlt 

17281 llrpekhgaa tgvdaictlr Idptgpgldr erlywelsql tnsitelgpy tldrdslyvn 
17341 gfnpwssvpt tstpgtstvh latsgtpssl pghttagpll vpftlnftit nllcyeednihc 
17401 pgsrkfntte rvlqslhgpm fkntsvgply sgcrltllrs ekdgaatgvd aicthrldpk 
17461 spglxxexly welsxltxxi xelgpytldr xslyvngfth xxsxpttstp gtstvxxgts 

55 17521 gtpssxpxxt xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttexvlq gllxpxfknx 

17581 svgxlysgcr Itxlrxekxg aatgxdaicx hxxxpkxpgl xxexlywels xltnsitelg 
17641 pytldrdsly vngf thrssm pttsipgtsa vhletsgtpa slpghtapgp llvpftlnft 
17701 itnlqyeedm rhpgsrkfnt tervlqgllk plfkstsvgp lysgcrltll rpekrgaatg 
17761 vdticthrld plnpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 

60 17821 tpgtstvxxg tsgtpssxpx xtxxxpllxp f txnxt itnl xxxxxmxxpg srkfnttexv 

17881 Iqgllxpxfk nxsvgxlysg crltxlrxek xgaatgxdai cxhxxxpkxp glxxexlywe 
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17941 Isxltxxixe Igpytldrxs lyvngfhpra svpttstpgt stvhlatsgt psslpghtap 
18001 vpllipftln ftitnlhyee nmqhpgsrkf nttervlqgl Igpmfkntsv gllysgcrlt 
18061 llrpekngaa tgmdaicshr Idpkapglxx exlywelsxl txxixelgpy tldrxslyvn 
18121 gfthxxsxpt tstpgtstvx xgtsgtpssx pxxtxxxpll xpftxnxtit nlxxxxxraxx 
5 18181 pgsrkfntte xvlqgllxpx fknxsvgxly sgcrltxlrx ekxgaatgxd aicxhxxxpk 

18241 xpglxxexly welsxltxxi xelgpytldr xslyvngfth qnsvpttstp gtstvywatt 
18301 gtpssfpght epgpllipft fnftitnlhy eenmqhpgsr kfnttervlq glltplfknt 
18361 svgplysgcr Itllrpekqe aatgvdtict hrvdpigpgl xxexlywels xltxxixelg 
18421 pytldrxsly vngfthxxsx pttstpgtst vxxgtsgtps sxpxxtxxxp llxpftxnxt 
10 18481 itnlxxxxxm xxpgsrkfnt texvlqgllx pxfknxsvgx lysgcrltxl rxekxgaatg 

18541 xdaicxhxxx pkxpglxxex lywelsxltx xixelgpytl drxslyvngf thrssvptts 
18601 spgtstvhla tsgtpsslpg htapvplllp ftlnftitnl hyeenmqhpg srkfntterv 
18661 Iqgllkplfk stsvgplysg crltllrpek hgaatgvdai ctlrldptgp glxxexlywe 
18721 Isxltxxixe Igpytldrxs lyvngfthxx sxpttstpgt stvxxgtsgt psBjq)xxtxx 
18781 xpllxpf txn xtltnlxxxx xmxxpgsrkf nttexvlqgl Ixpxfknxsv gxXysgcrlt 
18841 xlrxekxgaa tgxdaicxhx xxpkxpglxx exlywelsxl txxixelgpy tldrxslyvn 
18901 gfthrtsvpt tstpgtstvh latsgtpssl pghtapvpll ipftlnftit nlqyeedmhr 
18961 pgsrkfntte rvlqgllspi fknssvgply sgcrltslrp ekdgaatgmd avclyhpnpk 
19021 rpgldreqly celsqlthni telgpysldr dslyvngfth qnsvpttstp gtstvywatt 
19081 gtpssfpght xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttexvlq gllxpxfknx 
19141 svgxlysgcr Itxlrxekxg aatgxdaicx hxxxpkxpgl xxexlywels xltxxixelg 
19201 pytldrxsly vngfthwssg Ittstpwtst vdlgtsgtps pvpspttagp llvpftlnft 
19261 itnlqyeedin hrpgsrkfna tervlqglls pifkntsvgp lysgcrltll rpekqeaatg 
19321 vdticthrvd pigpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 
19381 tpgtstvxxg tsgtpssxpx xtxxxpllxp ftxnxtitnl xxxxxmxxpg srkfnttexv 
19441 Iqgllxpxfk nxsvgxlysg crltxlrxek xgaatgxdai cxhxxxpkxp glxxexlywe 
19501 Isxltxxixe Igpytldrxs lyvngfthrs fglttstpwt stvdlgtsgt pspvpsptta 
19561 gpllvpftln ftitnlqyee dmhrpgsrkf nttervlqgl Itplfmtsv sslysgcrlt 
19621 llrpekdgaa trvdavcthr pdpkspglxx exlywelsxl txxixelgpy tldrxslyvn 
19681 gfthxxsxpt tstpgtstvx xgtsgtpssx pxxtxxxpll xpftxnxtit nlxxxxxmxx 
19741 pgsrkfntte xvlqgllxpx fknxsvgxly sgcrltxlrx ekxgaatgxd aicxhxxxpk 
19801 xpglxxexly welsxltxxi xelgpytldr xslyvngfth wipvptsstp gtstvdlgsg 
19861 tpsslpsptt agpllvpftl nftitnlqyg edmghpgsrk fnttervlqg llgpifknts 
19921 vgplysgcrl tslrsekdga atgvdaicih hldpkspglx xexlywelsx Itxxixelgp 
35 19981 ytldrxslyv ngfthxxsxp ttstpgtstv xxgtsgtpss xpxxtxxxpl Ixpftxnxti 

20041 tnlxxxxxmx xpgsrkfntt exvlqgllxp xfknxsvgxl ysgcrltxlr xekxgaatgx 
2 0101 daicxhxxxp kxpglxxexl ywelsxltxx ixelgpytld rxslyvngft hqtfapntst 
20161 pgtstvdlgt sgtpsslpsp tsagpllvpf tlnftitnlq yeedmhhpgs rkfnttervl 
20221 qgllgpmfkn tsvgllysgc rltllrpelcn gaatrvdavc thrpdpkspg Ixxexlywel 
40 20281 sxltxxixel gpytldrxsl yvngfthxxs xpttstpgts tvxxgtsgtp ssxpxxtapv 

2 0341 pllipftlnf titnlhyeen mqhpgsrkfn ttervlqgll kplfkstsvg plysgcrltl 
2 0401 Irpekhgaat gvdaictlrl dptgpgldre rlywelsqlt nsvtelgpyt Idrdslyvng 
20461 ftqrssvptt sipgtsavhl etsgtpaslp ghtapgpllv pftlnftitn Iqyevdrarhp 
2 0521 gsrkfntter vlqgllkplf kstsvgplys gcrltllrpe krgaatgvdt icthrldpln 
45 2 0581 pgldreqlyw elskltrgii elgpylldrg slyvngfthr nfvpitstpg tstvhlgtse 

20641 tpsslprpiv pgpllvpftl nftitnlqye eamrhpgsrk fnttervlqg llrplfknts 
2 0701 igplysscrl tllrpekdka atrvdaicth hpdpqspgla reqlywelsq Ithgitelgp 
2 0761 ytldrdslyv dgfthwspip ttstpgtsiv nlgtsgipps Ipettxx:^)! Ixpftxnxti 
20821 tnlxxxxxmx xpgsrkfntt ervlqgllkp Ifkstsvgpl ysgcrltllr pekdgvatrv 
20881 daicthrpdp kipgldrqql ywelsqlths itelgpytld rdslyvngft qrssvpttst 
20941 pgtftvqpet setpsslpgp tatgpvllpf tlnftitnlq yeedmhrpgs rkfnttervl 
21001 qgllmplfkn tsvsslysgc rltllrpekd gaatrvdavc thrpdpkspg Idrerlywkl 
21061 sqlthgitel gpytldrhsl yvngfthqss mtttrtpdts tmhlatsrtp aslsgpttas 
21121 pllvlftinf titnlryeen mhhpgsrkfn ttervlqgll rpvfkntsvg plysgcrltl 
55 21181 Irpkkdgaat kvdaictyrp dpkspgldre qlywelsqlt hsitelgpyt Idrdslyvng 

21241 ftqrssvptt sipgtptvdl gtsgtpvskp gpsaaspllv Iftlnftitn Iryeenmqhp 
21301 gsrkfntter vlqgllrslf kstsvgplys gcrltllrpe kdgtatgvda icthhpdpks 
21361 prldreqlyw elsqlthnit elghyaldnd slfvngfthr ssvsttstpg tptvylgask 
21421 tpasifgpsa ashllilftl nftitnlrye enrawpgsrkf nttervlqgl Irplfkntsv 
21481 gplysgsrlt llrpekdgea tgvdaicthr pdptgpgldr eqlylelsql thsitelgpy 
21541 tldrdslyvn gfthrssvpt tstgwseep ftlnftinnl rymadmgqpg slkfnitdnv 



15 
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21601 mkhllsplfq rsslgarytg crvialrsvk 
21661 Isqqthgitr Igpysldkds lylngynepg 
21721 fcltlnftisn Iqyspdmgkg satfnstegv 
21781 dgaatgvdtt ctyhpdpvgp gldiqqlywe 
5 21841 Isirgeyqin fhlvnvnilsn pdptsseyit 

21901 tmdsvlvtvk alfssnldps Iveqvfldkt 
21961 sssstqhfyl nftltnlpys qdkaqpgttn 
22021 atfrsvpnrh htgvdslcnf splarrvdrv 
22081 yspnmeplt gnsdlpfwav iliglagllg 
10 22141 yyqshldled Iq 



ngaetrvdll ctylqplsgp glpikqvfhe 
Ideppttpkp attflpplse attaragyhlk 
Iqhllrplfq kssmgpfylg cqlielrpek 
Isqlthgvtq Igfyvldrds Ifingyapqn 
llrdiqdkvt tlykgsqlhd tfrfclvtnl 
Inasfhwlgs tyqlvdlhvt emessvyqpt 
yqmkmied alnqlfmss iksyf sdcc[v 
aiyeeflrmt rngtqlqnft Idrssvlvdg 
litclicgvl vttrrrlckeg eynvqqqcpg 



SEQ ID NO. 2 
15 CA125 nucleic acid Genbank No. AF414442 
CDS 205. .66663 

1 aagcgttgca caattccccc aacctccata catacggcag ctcttctaga cacaggtttt 
61 cccaggtcaa atgcggggac cccagccata tctcccaccc tgagaaattt tggagtttca 
121 gggagctcag aagctctgca gaggccaccc tctctgaggg gattcttctt agacctccat 
20 181 ccagaggcaa atgttgacct gtccatgctg aaaccctcag gccttcctgg gtcatcttct 

241 cccacccgct ccttgatgac agggagcagg agcactaaag ccacaccaga aatggattca 
301 ggactgacag gagccacctt gtcacctaag acatctacag gtgcaatcgt ggtgacagaa 
361 catactctgc cctttacttc cccagataag accttggcca gtcctacatc ttcggttgtg 
421 ggaagaacca cccagtcttt gggggtgatg tcctctgctc tccctgagtc aacctctaga 
25 481 ggaatgacac actccgagca aagaaccagc ccatcgctga gtccccaggt caatggaact 

541 ccctctagga actaccctgc tacaagcatg gtttcaggat tgagttcccc aaggaccagg 
601 accagttcca cagaaggaaa ttttaccaaa gaagcatcta catacacact cactgtagag 
661 accacaagtg gcccagtcac tgagaagtac acagtcccca ctgagacctc aacaactgaa 
721 ggtgacagca cagagacccc ctgggacaca agatatattc ctgtaaaaat cacatctcca 
30 781 atgaaaacat ttgcagattc aactgcatcc aaggaaaatg ccccagtgtc tatgactcca 

841 gctgagacca cagttactga ctcacatact ccaggaagga caaacccatc atttgggaca 
901 ctttattctt ccttccttga cctatcacct aaagggaccc caaattccag aggtgaaaca 
961 agcctggaac tgattctatc aaccactgga tatcccttct cctctcctga acctggctct 
1021 gcaggacaca gcagaataag taccagtgcg cctttgtcat catctgcttc agttctcgat 
35 1081 aataaaatat cagagaccag catattctca ggccagagtc tcacctcccc tctgtctcct 

1141 ggggtgcccg aggccagagc cagcacaatg cccaactcag ctatcccttt ttccatgaca 
1201 ctaagcaatg cagaaacaag tgccgaaagg gtcagaagca caatttcctc tctggggact 
1261 ccatcaatat ccacaaagca gacagcagag actatcctta ccttccatgc cttcgctgag 
1321 accatggata tacccagcac ccacatagcc aagactttgg cttcagaatg gttgggaagt 
40 1381 ccaggtaccc ttggtggcac cagcacttca gcgctgacaa ccacatctcc atctaccact 

1441 ttagtctcag aggagaccaa cacccatcac tccacgagtg gaaaggaaac agaaggaact 
1501 ttgaatacat ctatgactcc acttgagacc tctgctcctg gagaagagtc cgaaatgact 
1561 gccaccttgg tccccactct aggttttaca actcttgaca gcaagatcag aagtccatct 
1621 caggtctctt catcccaccc aacaagagag ctcagaacca caggcagcac ctctgggagg 
45 1681 cagagttcca gcacagctgc ccacgggagc tctgacatcc tgagggcaac cacttccagc 

1741 acctcaaaag catcatcatg gaccagtgaa agcacagctc agcaatttag tgaaccccag 
1801 cacacacagt gggtggagac aagtcctagc atgaaaacag agagaccccc agcatcaacc 
1861 agtgtggcag cccctatcac cacttctgtt ccctcagtgg tctctggctt caccaccctg 
1921 aagaccagct ccacaaaagg gatttggctt gaagaaacat ctgcagacac actcatcgga 
50 1981 gaatccacag ctggcccaac cacccatcag tttgctgttc ccactgggat ttcaatgaca 

2041 ggaggcagca gcaccagggg aagccagggc acaacccacc tactcaccag agccacagca 
2101 tcatctgaga catccgcaga tttgactctg gccacgaacg gtgtcccagt ctccgtgtct 
2161 ccagcagtga gcaagacggc tgctggctca agtcctccag gagggacaaa gccatcatat 
2221 acaatggttt cttctgtcat ccctgagaca tcatctctac agtcctcagc tttcagggaa 
55 2231 ggaaccagcc tgggactgac tccattaaac actagacatc ccttctcttc ccctgaacca 

2341 gactctgcag gacacaccaa gataagcacc agcattcctc tgttgtcatc tgcttcagtt 
2401 cttgaggata aagtgtcagc gaccagcaca ttctcacacc acaaagccac ctcatctatt 
2461 accacaggga ctcctgaaat ctcaacaaag acaaagccca gctcagccgt tctttcctcc 
2521 atgaccctaa gcaatgcagc aacaagtcct gaaagagtca gaaatgcaac ttcccctctg 
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2581 actcatccat ctccatcagg ggaagagaca gcagggagtg tcctcactct cagcacctct 
2641 gctgagacta cagactcacc taacatccac ccaactggga cactgacttc agaatcgtca 
2701 gagagtccta gcactctcag cctcccaagt gtctctggag tcaaaaccac attttcttca 
2761 tctactcctt ccactcatct atttactagt ggagaagaaa cagaggaaac ttcgaatcca 
5 2821 tctgtgtctc aacctgagac ttctgtttcc agagtaagga ccaccttggc cagcacctct 

2881 gtccctaccc cagtattccc caccatggac acctggccta cacgttcagc tcagttctct 
2941 tcatcccacc tagtgagtga gctcagagct acgagcagta cctcagttac aaactcaact 
3001 ggttcagctc ttcctaaaat atctcacctc actgggacgg caacaatgtc acagaccaat 
3061 agagacacgt ttaatgactc tgctgcaccc caaagcacaa cttggccaga gactagtccc 
10 3121 agattcaaga cagggttacc ttcagcaaca accactgttt caacctctgc cacttctctc 

3181 tctgctactg taatggtctc taaattcact tctccagcaa ctagttccat ggaagcaact 
3241 tctatcaggg aaccatcaac aaccatcctc acaacagaga ccacgaatgg cccaggctct 
3301 atggctgtgg cttctaccaa catcccaatt ggaaagggct acattactga aggaagattg 
3361 gacacaagcG atctgcccat tggaaccaca gcttcctotg agacatctat ggattttacc 
15 3421 atggccaaag aaagtgtctc aatgtcagta tctccatctc agtccatgga tgctgctggc 

3481 tcaagcactc caggaaggac aagccaattc gttgacacat tttctgatga tgtctatcat 
3541 ttaacatcca gagaaattac aatacctaga gatggaacaa get cage tct gactccacaa 
3601 atgactgcaa ctcaccctcc atctcctgat cctggctctg ctagaagcac ctggcttggc 
3661 atcttgtcct catctccttc ttctcctact cccaaagtca caatgagctc cacattttca 
20 3721 actcagagag tcaccacaag catgataatg gacacagttg aaactagtcg gtggaacatg 

3781 cccaacttac cttccacgac ttccctgaca ccaagtaata ttccaacaag tggtgccata 
3841 ggaaaaagca ccctggttcc cttggacact ccatctccag ccacatcatt ggaggcatca 
3901 gaagggggac ttccaaccct cagcacctac cctgaatcaa caaacacacc cagcatccac 
3961 ctcggagcac acgctagttc agaaagtcca agcaccatca aacttaccat ggcttcagta 
4021 gtaaaacctg gctcttacac acctctcacc ttcccctcaa tagagaccca cattcatgta 
4081 tcaacagcca gaatggctta ctcttctggg tcttcacctg agatgacagc tcctggagag 
4141 actaacactg gtagtacctg ggaccccacc acctacatca ccactacgga tcctaaggat 
4201 acaagttcag ctcaggtctc tacaccccac tcagtgagga cactcagaac cacagaaaac 
4261 catccaaaga cagagtccgc caccccagct gcttactctg gaagtcctaa aatctcaagt 
4321 tcacccaatc tcaccagtcc ggccacaaaa gcatggacca tcacagacac aactgaacac 
4381 tccactcaat tacattacac aaaattggca gaaaaatcat ctggatttga gacacagtca 
4441 gctccaggac ctgtctctgt agtaatccct acctccccta ccattggaag cagcacattg 
4501 gaactaactt ctgatgtccc aggggaaccc ctggtccttg ctcccagtga gcagaccaca 
4561 atcactctcc ccatggcaac atggctgagt accagtttga cagaggaaat ggcttcaaca 
35 4621 gaccttgata tttcaagtcc aagttcaccc atgagtacat ttgctatttt tccacctatg 

4681 tccacacctt ctcatgaact ttcaaagtca gaggcagata ccagtgccat tagaaataca 
4741 gattcaacaa cgttggatca gcacctagga atcaggagtt tgggcagaac tggggactta 
4801 acaactgttc ctatcacccc actgacaacc acgtggacca gtgtgattga acactcaaca 
4861 caagcacagg acaccctttc tgcaacgatg agtcctactc acgtgacaca gtcactcaaa 
40 4921 gatcaaacat ctataccagc ctcagcatcc ccttcccatc ttactgaagt ctaccctgag 

4981 ctcgggacac aagggagaag ctcctctgag gcaaccactt tttggaaacc atctacagac 
5041 acactgtcca gagagattga gactggccca acaaacattc aatccactcc acccatggac 
5101 aacacaacaa cagggagcag tagtagtgga gtcaccctgg gcatagccca ccttcccata 
5161 ggaacatcct ccccagctga gacatccaca aacatggcac tggaaagaag aagttctaca 
45 5221 gccactgtct ctatggctgg gacaatggga ctccttgtta ctagtgctcc aggaagaagc 

5281 atcagccagt cattaggaag agtttcctct gtcctttctg agtcaactac tgaaggagtc 
5341 acagattcta gtaagggaag cagcccaagg ctgaacacac agggaaatac agctctctcc 
5401 tcctctcttg aacccagcta tgctgaagga agccagatga gcacaagcat ccctctaacc 
5461 tcatctccta caactcctga tgtggaattc atagggggca gcacattttg gaccaaggag 
50 5521 gtcaccacag ttatgacctc agacatctcc aagtcttcag caaggacaga gtccagctca 

5581 gctaccctta tgtccacagc tttgggaagc actgaaaata caggaaaaga aaaactcaga 
5641 actgcctcta tggatcttec atctccaact ccatcaatgg aggtgacacc atggatttct 
5701 ctcactctca gtaatgcccc caataccaca gattcacttg acctcagcca tggggtgcac 
5761 accagctctg cagggacttt ggccactgac aggtcattga atactggtgt cactagagcc 
55 5821 tccagattgg aaaacggctc tgatacctct tctaagtccc tgtctatggg aaacagcact 

5881 cacacttcca tgactgacac agagaagagt gaagtgtctt cttcaatcca tccccgacct 
5941 gagacctcag ctcctggagc agagaccact ttgacttcca ctcctggaaa cagggccata 
6001 agcttaacat tgcctttttc atccattcca gtggaagaag tcatttctac aggcataacc 
6061 tcaggaccag acatcaactc agcacccatg acacattctc ccatcacccc accaacaatt 
^0 6121 gtatggacca gtacaggcac aattgaacag tccactcaac cactacatgc agtttcttca 

6181 gaaaaagttt ctgtgcagac acagtcaact ccatatgtca actctgtggc agtgtctgct 
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6241 tcccctaccc atgagaattc agtctcttct ggaagcagca catcctctcc atattcctca 
6301 gcctcacttg aatccttgga ttccacaatc agtaggagga atgcaatcac ttcctggcta 
6361 tgggacctca ctacatctct ccccactaca acttggccaa gtactagttt atctgaggca 
6421 ctgtcctcag gccattctgg ggtttcaaac ccaagttcaa ctacgactga atttccactc 
5 6481 ttttcagctg catccacatc tgctgctaag caaagaaatc cagaaacaga gacccatggt 

6541 ccccagaata cagccgcgag tactttgaac actgatgcat cctcggtcac aggtctttct 
6601 gagactcctg tgggggcaag tatcagctct gaagtccctc ttccaatggc cataacttct 
6661 agatcagatg tttctggcct tacatctgag agtactgcta acccgagttt aggcacagcc 
6721 tcttcagcag ggaccaaatt aactaggaca atatccctgc ccacttcaga gtctttggtt 
10 6781 tcctttagaa tgaacaagga tccatggaca gtgtcaatcc ctttggggtc ccatccaact 

6841 actaatacag aaacaagcat cccagtaaac agcgcaggtc cacctggctt gtccacagta 
6901 gcatcagatg taattgacac accttcagat ggggctgaga gtattcccac tgtctccttt 
6961 tccccctccc ctgatactga agtgacaact atctcacatt tcccagaaaa gacaactcat 
7021 tcatttagaa ccatttcatc tctcactcat gagttgactt caagagtgac acctattcct 
15 7081 ggggattgga tgagttcagc tatgtctaca aagcccacag gagccagtcc ctccattaca 

7141 ctgggagaga gaaggacaat cacctctgct gctccaacca cttcccccat agttctcact 
7201 gctagtttca cagagaccag cacagtttca ctggataatg aaactacagt aaaaacctca 
7261 gatatccttg acgcacggaa aacaaatgag ctcccctcag atagcagttc ttcttctgat 
7321 ctgatcaaca cctccatagc ttcttcaact atggatgtca ctaaaacagc ctccatcagt 
20 7381 cccactagca tctcaggaat gacagcaagt tcctccccat ctctcttctc ttcagataga 

7441 ccccaggttc ccacatctac aacagagaca aatacagcca cctctccatc tgtttccagt 
7501 aacacctatt ctcttgatgg gggctccaat gtgggtggca ctccatccac tttaccaccc 
7561 tttacaatca cccaccctgt cgagacaagc tcggccctat tagcctggtc tagaccagta 
7621 agaactttca gcaccatggt cagcactgac actgcctccg gagaaaatcc tacctctagc 
25 7681 aattctgtgg tgacttctgt tccagcacca ggtacatggg ccagtgtagg cagtactact 

7741 gacttacctg ccatgggctt tctcaagaca agtcctgcag gagaggcaca ctcacttcta 
7801 gcatcaacta ttgaaccagc cactgccttc actccccatc tctcagcagc agtggtcact 
7861 ggatccagtg ctacatcaga agccagtctt ctcactacga gtgaaagcaa agccattcat 
7921 tcttcaccac agaccccaac tacacccacc tctggagcaa actgggaaac ttcagctact 
30 7981 cctgagagcc ttttggtagt cactgagact tcagacacaa cacttacctc aaagattttg 

8041 gtcacagata ccatcttgtt ttcaactgtg tccacgccac cttctaaatt tccaagtacg 
8101 gggactctgt ctggagcttc cttccctact ttactcccgg acactccagc catccctctc 
8161 actgccactg agccaacaag ttcattagct acatcctttg attccacccc actggtgact 
8221 atagcttcgg atagtcttgg cacagtccca gagactaccc tgaccatgtc agagacctca 
35 8281 aatggtgatg cactggttct taagacagta agtaacccag ataggagcat ccctggaatc 

8341 actatccaag gagtaacaga aagtccactc catccttctt ccacttcccc ctctaagatt 
8401 gttgctccac ggaatacaac ctatgaaggt tcgatcacag tggcactttc tactttgcct 
8461 gcgggaacta ctggttccct tgtattcagt cagagttctg aaaactcaga gacaacggct 
8521 ttggtagact catcagctgg gcttgagagg gcatctgtga tgccactaac cacaggaagc 
40 8581 cagggtatgg ctagctctgg aggaatcaga agtgggtcca ctcactcaac tggaaccaaa 

8641 acattttctt ctctccctct gaccatgaac ccaggtgagg ttacagccat gtctgaaatc 
8701 accacgaaca gactgacagc tactcaatca acagcaccca aagggatacc tgtgaagccc 
8761 accagtgctg agtcaggcct cctaacacct gtctctgcct cctcaagccc atcaaaggcc 
8821 tttgcctcac tgactacagc tcccccatca acttggggga tcccacagtc taccttgaca 
45 8881 tttgagtttt ctgaggtccc aagtttggat actaagtccg cttctttacc aactcctgga 

8941 cagtccctga acaccattcc agactcagat gcaagcacag catcttcctc actgtcoaag 
9001 tctccagaaa aaaacccaag ggcaaggatg atgacttcca caaaggccat aagtgcaagc 
9061 tcatttcaat caacaggttt tactgaaacc cctgagggat ctgcctcccc ttctatggca 
9121 gggcatgaac ccagagtccc cacttcagga acaggggacc ctagatatgc ctcagagagc 
50 9181 atgtcttatc cagacccaag caaggcatca tcagctatga catcgacctc tcttgcatca 

9241 aaactcacaa ctctcttcag cacaggtcaa gcagcaaggt ctggttctag ttcctctccc 
9301 ataagcctat ccactgagaa agaaacaagc ttcctttccc ccactgcatc cacctccaga 
9361 aagacttcac tatttcttgg gccttccatg gcaaggcagc ccaacatatt ggtgcatctt 
9421 cagacttcag ctctgacact ttctccaaca tccactctaa atatgtccca ggaggagcct 
55 9481 cctgagttaa cctcaagcca gaccattgca gaagaagagg gaacaacagc tgaaacacag 

9541 acgttaacct tcacaccatc tgagacccca acatccttgt tacctgtctc ttctcccaca 
9601 gaacccacag ccagaagaaa gagttctcca gaaacatggg caagctctat ttcagttcct 
9661 gccaagacct ccttggttga aacaactgat ggaacgctag tgaccaccat aaagatgtca 
9721 agccaggcag cacaaggaaa ttccacgtgg cctgccccag cagaggagac ggggaccagt 
9781 ccagcaggca catccccagg aagcccagaa gtgtctacca ctctcaaaat catgagctcc 
9841 aaggaaccca gcatcagccc agagatcagg tccactgtgc gaaattctcc ttggaagact 
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9901 ccagaaacaa ctgttcccat ggagaccaca gtggaaccag tcacccttca gtccacagcc 
9961 ctaggaagtg gcagcaccag catctctcac ctgcccacag gaaccacatc accaaccaag 
10021 tcaccaacag aaaatatgtt ggctacagaa agggtctccc tctccccatc cccacctgag 
lOQBl gcttggacca acctttattc tggaactcca ggagggacca ggcagtcact ggccacaatg 
5 10141 tcctctgtct ccctagagtc accaactgct agaagcatca cagggactgg tcagcaaagc 

10201 agtccagaac tggtttcaaa gacaactgga atggaattct ctatgtggca tggctctact 
10261 ggagggacca caggggacac acatgtctct ctgagcacat cttccaatat ccttgaagac 
10321 cctgtaacca gcccaaactc tgtgagctca ttgacagata aatccaaaca taaaaccgag 
10381 acatgggtaa gcaccacagc cattccctcc actgtcctga ataataagat aatggcagct 
10 10441 gaacaacaga caagtcgatc tgtggatgag gcttattcat caactagttc ttggtcagat 

10501 cagacatctg ggagtgacat cacccttggt gcatctcctg atgtcacaaa cacattatac 
10S61 atcacctcca cagcacaaac cacctcacta gtgtctctgc cctctggaga ccaaggcatt 
10621 acaagcctca ccaatccctc aggaggaaaa acaagctctg cgtcatctgt cacatctcct 
10681 tcaatagggc ttgagactct gagggccaat gtaagtgcag tgaaaagtga cattgcccct 
15 10741 actgctgggc atctatctca gacttcatct cctgcggaag tgagcatect ggacgtaacc 

10801 acagctccta ctccaggtat ctccaccacc atcaccacca tgggaaccaa ctcaatctca 
10861 actaccacac ccaacccaga agtgggtatg agtaccatgg acagcacccc ggccacagag 
10921 aggcgcacaa cttctacaga acacccttcc acctggtctt ccacagctgc atcagattcc 
10981 tggactgtca cagacatgac ttcaaacttg aaagttgcaa gatctcctgg aacaatttcc 
20 11041 acaatgcata caacttcatt cttagcctca agcactgaat tagactccat gtctactccc 

11101 catggccgta taactgtcat tggaaccagc ctggtcactc catcctctga tgcttcagct 
11161 gtaaagacag agaccagtac aagtgaaaga acattgagtc cttcagacac aactgcatct 
11221 actcccatct caactttttc tcgtgtccag aggatgagca tctcagttcc tgacatttta 
11281 agtacaagtt ggactcccag tagtacagaa gcagaagatg tgcctgtttc aatggtttct 
25 11341 acagatcatg ctagtacaaa gactgaccca aatacgcccc tgtccacttt tctgtttgat 

11401 tctctgtcca ctcttgactg ggacactggg agatctctgt catcagccac agccactacc 
11461 tcagctcctc agggggccac aactccccag gaactcactt tggaaaccat gatcagccca 
11521 gctacctcac agttgccctt ctctataggg cacattacaa gtgcagtcac accagctgca 
11581 atggeaagga gctctggagt tactttttca agaccagatc ccacaagcaa aaaggcagag 
11641 cagacttcca ctcagcttcc caccaccact tctgcacatc cagggcaggt gcccagatca 
11701 gcagcaacaa ctctggatgt gatcccacac acagcaaaaa ctccagatgc aacttttcag 
11761 agacaagggc agacagctct tacaacagag gcaagagcta catctgactc ctggaatgag 
11821 aaagaaaaat caaccccaag tgcaccttgg atcactgaga tgatgaattc tgtctcagaa 
11881 gataccatca aggaggttac cagctcctcc agtgtattaa aggaccctga atacgctgga 
35 11941 cataaacttg gaatctggga cgacttcatc cccaagtttg gaaaagcagc ccatatgaga 

12001 gagttgcccc ttctgagtcc accacaggac aaagaggcaa ttcacccttc tacaaacaca 
12061 gtagagacca caggctgggt cacaagttcc gaacatgctt ctcattccac tatcccagcc 
12121 cactcagcgt catccaaact cacatctcca gtggttacaa cctccaccag ggaacaagca 
12181 atagtttcta tgtcaacaac cacatggcca gagtctacaa gggctagaac agagcctaat 
40 12241 tccttcttga ctattgaact gagggacgtc agcccttaca tggacaccag ctcaaccaca 

12301 caaacaagta ttatctcttc cccaggttcc actgcgatca ccaaggggcc tagaacagaa 
12361 attacctcct ctaagagaat atccagctca ttccttgccc agtctatgag gtcgtcagac 
12421 agcccctcag aagccatcac caggctgtct aactttcctg ccatgacaga atctggagga 
12481 atgatccttg ctatgcaaac aagtccacct ggcgctacat cactaagtgc acctactttg 
45 12541 gatacatcag ccacagcctc ctggacaggg actccactgg ctacgactca gagatttaca 

12601 tactcagaga agaccactct ctttagcaaa ggtcctgagg atacatcaca gccaagccct 
12661 cGctctgtgg aagaaaccag ctcttcctct tccctggtac ctatccatgc tacaacctcg 
12721 ccttccaata ttttgttgac atcacaaggg cacagtccct cctctactcc acctgtgacc 
12781 tcagttttct tgtctgagac ctctggcctg gggaagacca cagacatgtc gaggataagc 
50 12841 ttggaacctg gcacaagttt acctcccaat ttgagcagta cagcaggtga ggcgttatcc 

12901 acttatgaag cctccagaga tacaaaggca attcatcatt ctgcagacac agcagtgacg 
12961 aatatggagg caaccagttc tgaatattct cctatcccag gccatacaaa gccatccaaa 
13021 gccacatctc cattggttac ctcccacatc atgggggaca tcacttcttc cacatcagta 
13081 tttggctcct ccgagaccac agagattgag acagtgtcct ctgtgaacca gggacttcag 
55 13141 gagagaagca catcccaggt ggccagctct gctacagaga caagcactgt cattacccat 

13201 gtgtctagtg gtgatgctac tactcatgtc accaagacac aagccacttt ctctagcgga 
13261 acatccatct caagccctca tcagtttata acttctacca acacatttac agatgtgagc 
13321 accaacccct ccacctctct gataatgaca gaatcttcag gagtgaccat caccacccaa 
13381 acaggtccta ctggagctgc aacacagggt ccatatctct tggacacatc aaccatgcct 
00 13441 tacttgacag agactccatt agctgtgact ccagatttta tgcaatcaga gaagaccact 

13501 ctcataagca aaggtcccaa ggatgtgacc tggacaagcc ctccctctgt ggcagaaacc 
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13561 agctatccct cttccctgac acctttcttg gtcacaacca tacctcctgc cacttccacg 

13621 ttacaagggc aacatacatc ctctcctgtt tctgcgactt cagttcttac ctctggactg 

13681 gtgaagacca cagatatgtt gaacacaagc atggaacctg tgaccaattc acctcaaaat 

13741 ttgaacaatc catcaaatga gatactggcc actttggcag ccaccacaga tatagagact 

5 13801 attcatcctt ccataaacaa agcagtgacc aatatgggga ctgccagttc agcacatgta 

13861 ctgcattcca ctctcccagt cagctcagaa ccatctacag ccacatctce aatggttcct 

13921 gcctccagca tgggggacgc tcttgcttct atatcaatac ctggttctga gaccacagac 

13981 attgagggag agccaacatc ctccctgact gctggacgaa aagagaacag caccctccag 

14041 gagatgaact caactacaga gtcaaacatc atcctctcca atgtgtctgt gggggctatt 

10 14101 actgaagcca caaaaatgga agtcccctct tttgatgcaa cattcatacc aactcctgct 

14161 cagtcaacaa agttcccaga tattttctca gtagccagca gtagactttc aaactctcct 

14221 cccatgacaa tatctaccca catgaccacc acccagacag ggtcttctgg agctacatca 

14281 aagattccac ttgccttaga cacatcaacc ttggaaacct cagcagggac tccatcagtg 

14341 gtgactgagg ggtttgccca ctcaaaaata accactgcaa tgaacaatga tgtcaaggac 

15 14401 gtgtcacaga caaaccctcc ctttcaggat gaagccagct ctccctcttc tcaagcacct 

14461 gtccttgtca caaccttacc ttcttctgtt gctttcacac cgcaatggca cagtacctcc 

14521 tctcctgttt ctatgtcctc agttcttact tcttcactgg taaagaccgc aggcaaggtg 

14581 gatacaagct tagaaacagt gaccagttca cctcaaagta tgagcaacac tttggatgac 

14641 atatcggtca cttcagcagc caccacagat atagagacaa cgcatccttc cataaacaca 

20 14701 gtagttacca atgtggggac caccggttca gcatttgaat cacattctac tgtctcagct 

14761 tacccagagc catctaaagt cacatctcca aatgttacca cctccaccat ggaagacacc 

14821 acaatttccc gatcaatacc taaatcctct aagactacaa gaactgagac tgagacaact 

14881 tcctccctga ctcctaaact gagggagacc agcatctccc aggagatcac ctcgtccaca 

14941 gagacaagca ctgttcctta caaagagctc actggtgcca ctaccgaggt atccaggaca 

25 15001 gatgtcactt cctctagcag tacatccttc cctggccctg atcagtccac agtgtcacta 

15061 gacatctcca cagaaaccaa caccaggctg tctacctccc caataatgac agaatctgca 

15121 gaaataacca tcaccaccca aacaggtcct catggggcta catcacagga tacttttacc 

15181 atggacccat caaatacaac cccccaggca gggatccact cagctatgac tcatggattt 

15241 tcacaattgg atgtgaccac tcttatgagc agaattccac aggatgtatc atggacaagt 

30 15301 cctccctctg tggataaaac cagctccccc tcttcctttc tgtcctcacc tgcaatgacc 

15361 acaccttccc tgatttcttc taccttacca gaggataagc tctcctctcc tatgacttca 

15421 cttctcacct ctggcctagt gaagattaca gacatattac gtacacgctt ggaacctgtg 

15481 accagctcac ttccaaattt cagcagcacc tcagataaga tactggccac ttctaaagac 

15541 agtaaagaca caaaggaaat ttttccttct ataaacacag aagagaccaa tgtgaaagcc 

35 15601 aacaactctg gacatgaatc ccattcccct gcactggctg actcagagac acccaaagcc 

15661 acaactcaaa tggttatcac caccactgtg ggagatccag ctccttccac atcaatgcca 

15721 gtgcatggtt cctctgagac tacaaacatt aagagagagc caacatattt cttgactcct 

15781 agactgagag agaccagtac ctctcaggag tccagctttc ccacggacac aagttttcta 

15841 ctttccaaag tccccactgg tactattact gaggtctcca gtacaggggt caactcttct 

40 15901 agcaaaattt ccaccccaga ccatgataag tccacagtgc cacctgacac cttcacagga 

15961 gagatcccca gggtcttcac ctcctctatt aagacaaaat ctgcagaaat gacgatcacc 

16021 acccaagcaa gtcctcctga gtctgcatcg cacagtaccc ttcccttgga cacatcaacc 

16081 acactttccc agggagggac tcattcaact gtgactcagg gattcccata ctcagaggtg 

16141 accactctca tgggcatggg tcctgggaat gtgtcatgga tgacaactcc ccctgtggaa 

45 16201 gaaaccagct ctgtgtcttc cctgatgtct tcacctgcca tgacatcccc ttctcctgtt 

16261 tcctccacat caccacagag catcccctcc tctcctcttc ctgtgactgc acttcctact 

16321 tctgttctgg tgacaaccac agatgtgttg ggcacaacaa gcccagagtc tgtaaccagt 

16381 tcacctccaa atttgagcag catcactcat gagagaccgg ccacttacaa agacactgca 

16441 cacacagaag ccgccatgca tcattccaca aacaccgcag tgaccaatgt agggacttcc 

50 16501 gggtctggac ataaatcaca abcctctgtc ctagctgact cagagacatc gaaagccaca 

16561 cctctgatga gtaccacctc caccctgggg gacacaagtg tttccacatc aactcctaat 

16621 atctctcaga ctaaccaaat tcaaacagag ccaacagcat ccctgagccc tagactgagg 

16681 gagagcagca cgtctgagaa gaccagctca acaacagaga caaatactgc cttttcttat 

16741 gtgcccacag gtgctattac tcaggcctcc agaacagaaa tctcctctag cagaacatcc 

55 16801 atctcagacc ttgatcggcc cacaatagca cccgacatct ccacaggaat gatcaccagg 

16861 Gtcttcacct cccccatcat gacaaaatct gcagaaatga ccgtcaccac tcaaacaact 

16921 actcctgggg ctacatcaca gggtatcctt ccttgggaca catcaaccac acttttccag 

16981 ggagggactc attcaaccgt gtctcaggga ttcccacact cagagataac cactcttcgg 

17041 agcagaaccc ctggagatgt gtcatggatg acaactcccc ctgtggaaga aaccagctct 

60 17101 gggttttccc tgatgtcacc ttccatgaca tccccttctc ctgtttcctc cacatcacca 

17161 gagagcatcc cctcctctcc tctccctgtg actgcacttc ttacttctgt tctggtgaca 
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17221 accaccaatg tattgggcac aacaagccca gagaccgtaa cgagttcacc tccaaattta 
17281 agcagcccca cacaggagag actgaccact tacaaagaca ctgcgcacac agaagccatg 
17341 catgcttcca tgcatacaaa cactgcagtg gccaacgtcg ggacctccat ttctggacat 
17401 gaatcacaat cttctgtccc agctgattca cacacatcca aagccacatc tccaatgggt 
5 17461 atcaccttcg ccatggggga tacaagtgtt tctacatcaa ctcctgcctt ctttgagact 

17521 agaattcaga ctgaatcaac atcctctttg attcctggat taagggacac caggacgtct 
17581 gaggagatca acactgtgac agagaccagc actgtccttt cagaagtgcc c act act act 
17641 actactgagg tctccaggae agaagttatc acttccagca gaaceiaccat ctcagggcct 
17701 gatcattcca aaatgtcacc ctacatctcc acagaaacca tcaccaggct ctccactttt 
10 17761 ccttttgtaa caggatccac agaaatggcc atcaccaacc aaacaggtcc tatagggact 

17821 atctcacagg ctacccttac cctggacaca tcaagcacag cttcctggga agggactcac 
17881 tcacctgtga ctcagagatt tccacactca gaggagacca ctactatgag cagaagtact 
17941 aagggcgtgt catggcaaag ccctccctct gtggaagaaa ccagttctcc ttcttcccca 
18001 gtgcctttac ctgeaataae ctcacattca tctctttatt ccgcagtatc aggaagtagc 
15 18061 cccacttctg ctctccctgt gacttccctt ctcacctctg gcaggaggaa gaccatagac 

18121 atgttggaca cacactcaga acttgtgacc agctccttac caagtgcaag tagcttctca 
18181 ggtgagatac tcacttctga agcctccaca aatacagaga caattcactt ttcagagaac 
18241 acagcagaaa ccaatatggg gaccaccaat tctatgcata aactacattc ctctgtctca 
18301 atccactccc agccatccgg acacacacct ccaaaggtta ctggatctat gatggaggac 
20 18361 gctattgttt ccacatcaac acctggttct cctgagacta aaaatgttga cagagactca 

18421 acatcccctc tgactcctga actgaaagag gacagcaccg ccctggtgat gaactcaact 
18481 acagagtcaa acactgtttt ctccagtgtg tccctggatg ctgctactga ggtctccagg 
18541 gcagaagtca cctactatga tcctacattc atgccagctt ctgctcagtc aacaaagtcc 
18601 ccagacattt cacctgaagc cagcagcagt cattctaact ctcctccctt gacaatatct 
25 18661 acacacaaga ccatcgccac acaaacaggt ccttctgggg tgacatctct tggccaactg 

18721 accctggaca catcaaccat agccacctca gcaggaactc catcagccag aactcaggat 
18781 tttgtagatt cagaaacaac cagtgtcatg aacaatgatc tcaatgatgt gttgaagaca 
18841 agccctttct ctgcagaaga agccaactct ctctcttctc aggcacctct ccttgtgaca 
18901 acctcacctt ctcctgtaac ttccacattg caagagcaca gtacctcctc tcttgtttct 
30 18961 gtgacctcag tacccacccc tacactggcg aagatcacag acatggacac aaacttagaa 

19021 cctgtgactc gttcacctca aaatttaagg aacaccttgg ccacttcaga agccaccaca 
19081 gatacacaca caatgcatcc ttctataaac acagcaatgg ccaatgtggg gaccaccagt 
19141 tcaccaaatg aattctattt tactgtctca cctgactcag acccatataa agccacatcc 
19201 gcagtagtta tcacttccac ctcgggggac tcaatagttt ccacatcaat gcctagatcc 
35 19261 tctgcgatga aaaagattga gtctgagaca actttctccc tgatatttag actgagggag 

19321 actagcacct cccagaaaat tggctcatcc tcagacacaa gcacggtctt tgacaaagca 
19381 ttcactgctg ctactactga ggtctccaga acagaactca cctcctctag cagaacatce 
19441 atccaaggca ctgaaaagcc cacaatgtca ccggacacct ccacaagatc tgtcaccatg 
19501 ctttctactt ttgctggcct gacaaaatcc gaagaaagga ccattgccac ccaaacaggt 
40 19561 cctcataggg cgacatcaca gggtaccctt acctgggaca catcaatcac aacctcacag 

19621 gcagggaccc actcagctat gactcatgga ttttcacaat tagatttgtc cactcttacg 
19681 agtagagttc ctgagtacat atcagggaca agcccaccct ctgtggaaaa aaccagctct 
19741 tcctcttccc ttctgtcttt accagcaata acctcaccgt cccctgtacc tactacatta 
19801 ccagaaagta ggccgtcttc tcctgttcat ctgacttcac tccccacctc tggcctagtg 
45 19861 aagaccacag atatgctggc atctgtggcc agtttacctc caaacttggg cagcacctca 

19921 cataagatac cgactacttc agaagacatt aaagatacag agaaaatgta tccttccaca 
19981 aacatagcag taaccaatgt ggggaccacc acttctgaaa aggaatctta ttcgtctgtc 
20041 ccagcctact cagaaccacc caaagtcacc tctccaatgg ttacctcttt caacataagg 
2 0101 gacaccattg tttccacatc catgcctggc tcctctgaga ttacaaggat tgagatggag 
50 20161 tcaacattct ccgtggctca tgggctgaag ggaaccagca cctcccagga ccccatcgta 

2 0221 tccacagaga aaagtgctgt ccttcacaag ttgaccactg gtgctactga gacctctagg 
20281 acagaagttg cctcttctag aagaacatcc attccaggcc ctgatcattc cacagagtca 
20341 ccagacatct ccactgaagt gatccccagc ctgcctatct cccttggcat tacagaatct 
20401 tcaaatatga ccatcatcac tcgaacaggt cctcctcttg gctctacatc acagggcaca 
55 20461 tttaccttgg acacaccaac tacatcctcc agggcaggaa cacactcgat ggcgactcag 

20521 gaatttccac actcagaaat gaccactgtc atgaacaagg accctgagat tctatcatgg 
20581 acaatccctc cttctataga gaaaaccagc ttctcctctt ccctgatgcc ttcaccagcc 
20641 atgacttcac ctcctgtttc ctcaacatta ccaaagacca ttcacaccac tccttctcct 
20701 atgacctcac tgctcacccc tagcctagtg atgaccacag acacattggg cacaagccca 
20761 gaacctacaa ccagttcacc tccaaatttg agcagtacct cacatgtgat actgacaaca 
20821 gatgaagaca ccacagctat agaagccatg catccttcca caagcacagc agcgactaat 
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20881 gtggaaacca cctgttctgg acatgggtca caatcctctg tcctaactga ctcagaaaaa 
20941 accaaggcca cagctccaat ggataccacc tccaccatgg ggcatacaac tgtttccaca 
21001 tcaatgtctg tttcctctga gactacaaaa attaagagag agtcaacata ttccttgact 
21061 cctggactga gagagaccag catttcccaa aatgccagct tttccactga cacaagtatt 
21121 gttctttcag aagtccccac tggtactact gctgaggtct ccaggacaga agtcacctcc 
21181 tctggtagaa catccatccc tggcccttct cagtccacag ttttgccaga aatatccaca 
21241 agaacaatga caaggctctt tgcctcgccc accatgacag aatcagcaga aatgaccatc 
21301 cccactcaaa caggtccttc tgggtctacc tcacaggata cccttacctt ggacacatcc 
21361 accacaaagt cccaggcaaa gactcattca actttgactc agagatttcc acactcagag 
21421 atgaccactc tcatgagcag aggtcctgga gatatgtcat ggcaaagctc tccctctctg 
21481 gaaaatccca gctctctccc ttccctgctg tctttacctg ccacaacctc acctcctccc 
21541 atttcctcca cattaccagt gactatctcc tcctctcctc ttcctgtgac ttcacttctc 
21601 acctctagcc cggtaacgac cacagacatg ttacacacaa gcccagaact tgtaaccagt 
21661 tcacctccaa agctgagcca cacttcagat gagagactga ccactggcaa ggacaccaca 
21721 aatacagaag ctgtgcatcc ttccacaaac acagcagcgt ccaatgtgga gattcccagc 
21781 tttggacatg aatccccttc ctctgcctta gctgactcag agacatccaa agccacatca 
21841 ccaatgttta ttacctccac ccaggaggat acaactgttg ccatatcaac ccctcacttc 
21901 ttggagacta gcagaattca gaaagagtca atttcctccc tgagccctaa attgagggag 
21961 acaggcagtt ctgtggagac aagctcagcc atagagacaa gtgctgtcct ttctgaagtg 
20 22021 tccattggtg ctactactga gatctccagg acagaagtca cctcctctag cagaacatcc 

22081 atctctggtt ctgctgagtc cacaatgttg ccagaaatat ccaccacaag aaaaatcatt 
22141 aagttcccta cttcccccat cctggcagaa tcatcagaaa tgaccatcaa gacccaaaca 
22201 agtcctcctg ggtctacatc agagagtacc tttacattag acacatcaac cactccctcc 
22261 ttggtaataa cccattcgac tatgactcag agattgccac actcagagat aaccactctt 
25 22321 gtgagtagag gtgctgggga tgtgccacgg cccagctctc tccctgtgga agaaacaagc 

22381 cctccatctt cccagctgtc tttatctgcc atgatctcac cttctcctgt ttcttccaca 
22441 ttaccagcaa gtagccactc ctcttctgct tctgtgactt cacctctcac accaggccaa 
22501 gtgaagacta ctgaggtgtt ggacgcaagt gcagaacctg aaaccagttc acctccaagt 
22561 ttgagcagca cctcagttga aatactggcc acctctgaag tcaccacaga tacggagaaa 
30 22621 attcatcctt tcccaaacac ggcagtaacc aaagttggaa cttccagttc tggacatgaa 

22681 tccccttcct ctgtcctacc tgactcagag acaaccaaag ccacatcggc aatgggtacc 
22741 atctccatta tgggggatac aagtgtttct acattaactc ctgccttatc taacactagg 
22601 aaaattcagt cagagccagc ttcctcactg accaccagat tgagggagac cagcacctct 
22861 gaagagacca gcttagccac agaagcaaac actgttcttt ctaaagtgtc cactggtgct 
35 22921 actactgagg tctccaggac agaagccatc tcctttagca gaacatccat gtcaggccct 

22981 gagcagtcca caatgtcaca agacatctcc ataggaacca tccccaggat ttctgcctcc 
23041 tctgtcctga cagaatctgc aaaaatgacc atcacaaccc aaacaggtcc ttcggagtct 
23101 acactagaaa gtacccttaa tttgaacaca gcaaccacac cctcttgggt ggaaacccac 
23161 tctatagtaa ttcagggatt tccacaccca gagatgacca cttccatggg cagaggtcct 
40 23221 ggaggtgtgt catggcctag ccctcccttt gtgaaagaaa ccagccctcc atcctccccg 

23281 ctgtctttac ctgccgtgac ctcacctcat cctgtttcca ccacattcct agcacatatc 
23341 cccccctctc cccttcctgt gacttcactt ctcacctctg gcccggcgac aaccacagat 
23401 atcttgggta caagcacaga acctggaacc agttcatctt caagtttgag caccacctcc 
23461 catgagagac tgaccactta caaagacact gcacatacag aagccgtgca tccttccaca 
45 23521 aacacaggag ggaccaatgt ggcaaccacc agctctggat ataaatcaca gtcctctgtc 

23581 ctagctgact catctccaat gtgtaccacc tccaccatgg gggatacaag tgttctcaca 
23 641 tcaactcctg ccttccttga gactaggagg attcagacag agctagcttc ctccctgacc 
23701 cctggattga gggagtccag tggctctgaa gggaccagct caggcaccaa gatgagcact 
23761 gtcctctcta aagtgcccac tggtgctact aetgagatct ccaaggaaga cgtcacctcc 
50 23821 atcccaggtc ccgctcaatc cacaatatca ccagacatct ccacaagaac cgtcagctgg 

23881 ttctctacat cccctgtcat gacagaatca gcagaaataa ccatgaacac ccatacaagt 
23941 cctttagggg ccacaacaca aggcaccagt actttggcca cgtcaagcac aacctctttg 
24001 acaatgacac actcaactat atetcaagga ttttcacact cacagatgag cactcttatg 
24061 aggaggggtc ctgaggatgt atcatggatg agccctcccc ttctggaaaa aactagacct 
55 24121 tccttttctc tgatgtcttc accagccaca acttcacctt ctcctgtttc ctccacatta 

24181 ccagagagca tctcttcctc tcctcttcct gtgacttcac tcctcacgtc tggcttggca 
24241 aaaactacag atatgttgca caaaagctca gaacctgtaa ccaactcacc tgcaaatttg 
24301 agcagcacct cagttgaaat actggccacc tctgaagtca ccacagatac agagaaaact 
24361 catccttctt caaacagaac agtgaccgat gtggggacct ccagttctgg acatgaatcc 
60 24421 acttcctttg tcctagctga ctcacagaca tccaaagtca catctccaat ggttattacc 

24481 tccaccatgg aggatacgag tgtctccaca tcaactcctg gcttttttga gactagcaga 
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attcagacag 
gggaccagct 
gctgaagtct 
cagctcacag 
ataatgacag 
ccagagagta 
actgtgactc 
gatatgttat 
tctctgcctg 
tccgcttctc 
atgggcataa 
gagagactga 
acagcagtga 
tctgactcag 
acgagtgttt 
acatcctccc 
acagagggaa 
acagaagtga 
ccagacatct 
gcagaaagtg 
accttggaca 
ttttcacact 
agccttccct 
acctcaactt 
actgcacttc 
cctgaaacca 
gaagtcacca 
gggactgtga 
aaacccacat 
actcctgcct 
agggagatca 
ggaatgccca 
acatccaccc 
actagaattt 
acaggtcatt 
tcctggccag 
cctatgagca 
agccctccat 
aqaccatctg 
atgatgaaga 
agtatgaata 
gcaattcagc 
gaattctatt 
acctcttcca 
agaattgaga 
caggagatcc 
acgattgagg 
tccacaatgt 
aaggcagaat 

tctcaaggat 
ccatggctaa 
cctgtcatga 
tcgcttcctg 
acaagctcag 
ctggccacca 
tcaggttata 
acatcttcaa 
gccttctctg 
atggagacca 
agtgtgccca 



aaccaacatc 
tagccacaga 
ccaggacaga 
tgtcaccaga 
aatcagcaga 
ctcatactgt 
agagattttc 
ggcctagtca 
ccacgacctc 
ttcctgtgac 
gcagagaacc 
ccactttgga 
ccaacgtgag 
agacacccaa 
ccatatccac 
tgacttctgg 
gcactgtcct 
tatcctctag 
ctactgaagc 
ccatcactat 
cctcaacaac 
cagagatgac 
ctgtggaaga 
cttttttctc 
tcacccttgg 
gttcacctcc 
aagatagaga 
tttataaaca 
ctccaatggc 
tcccagaaac 
gtacctctca 
ctggtgctac 
caggtcctgc 
ctactcccct 
ctggggcatc 
gaactcactc 
gaggtcctga 
cttccctggt 
agagtagcca 
ccacagacat 
tcacctcaga 
tttcagaaaa 
cctcttatcc 
ccataaiaaga 
t^S9a9tcaac 
actcagccac 
actccatgac 
cacaagacat 
ctacagaaat 
ttaccttgga 
tttcacactc 
gccabccctc 
cctcatcttc 
tgacatcact 
aacctgaaac 
ctgaagtcac 
cacatgaatc 
tgggtatcac 
acaccagtag 
gcatctctga 
ctggtgctac 



ctccctgacc 
gatgagcact 
agtcacctcc 
gacttccaca 
aatgatgatc 
ggacatatca 
acactcagag 
atcctctgtg 
accttctcct 
ttctcttctc 
tggaaccagt 
agacactgta 
gacctccatt 
agccacatct 
ttctgacttc 
attgagggag 
ttctgaagtg 
gggaacatcc 
gatcaccagg 
tgagacaggt 
aaccttttgg 
cactcttatg 
agccagctct 
cgcattacca 
cccagtgaag 
aaatttgagc 
gaaaattcat 
tctatcccct 
taccacctcc 
tatgatgaca 
agagaccagc 
tactaaggtc 
tcaatccaca 
caccacgaca 
ctcacaaggt 
agctgcaact 
ggatgtgtca- 
gtctttatct 
ctcatctcct 
gttggacaca 
tgagagtctg 
cacagctgtg 
aggcctccca 
cattgtttct 
atccaccctg 
aaagccaagc 
acaagtcatg 
atccagtgaa 
gaccattacc 
cacttcaaca 
acagatgacc 
tgtggaagaa 
tcccgtttct 
tctcacctca 
cagttcaccc 
tacagataca 
tccttcctct 
ctaccccaca 
gattcaaaca 
agagaccagc 
tactgaggtc 



cttggactga 
gtcctttctg 
tctagcagaa 
gaaaccatca 
aagacacaaa 
acaacaccca 
atgaccactc 
gaagaaacca 
gtttcctcta 
acccctggcc 
tccacttcaa 
gatacagaag 
tctggacatg 
ccaatgggta 
tttgagacca 
accagcagct 
cccagtggtg 
atgtcagggc 
ctttctactt 
tctcctgggg 
tcagggaccc 
agtagaactc 
gtctcttcct 
gagagcatct 
accacagaca 
agcacctcag 
ccctcctcaa 
tcctctgttt 
actctgggga 
cagccaactt 
tcagcaacag 
tccagaacag 
atatcaccag 
ggatcagcag 
acctttacct 
cacagatctc 
tggccaagcc 
gcagtaacct 
ctccgggtga 
agcttggaac 
gccacttcta 
actcagatgg 
gagccatcca 
acaaccatac 
acccccacac 
actgttcctt 
tcctctagca 
gtgatcacca 
acccaaacag 
acttttatgt 
gctcttatga 
gccagctctg 
tccacattac 
gggctggtga 
ccaaatttga 
gagaaactgg 
gtcctagctg 
ggagatacaa 
aagtcaaagc 
tctgccacag 
tccaggacag 



gaaagaccag 
gagtgcccac 
catccatctc 
ccagactccc 
cagatcctcc 
actgggtaga 
ttgtgagcag 
gctctgcctc 
cattagtaga 
tggtgataac 
atttgagcag 
acatgcagcc 
aatcacaatc 
ccacctacac 
gcagaattca 
ctgagaggat 
ctaccactga 
ctgatcagtt 
cccccattat 
ctacatcaga 
actcaactgc 
ctggagatgt 
cactgtcttc 
cctcctctcc 
tgttgcgcac 
ctgaaatatt 
acacacctgt 
tggctgactt 
atacaagtgt 
cctccctgac 
agagaagtgc 
aagccctctc 
aaatctccac 
aaatgaccat 
tggacacatc 
cacactcagg 
gcccatcagt 
caccttcgcc 
cttctctttt 
ctgtgaccac 
aagccaccat 
gcaccatcag 
aagtgacatc 
ctgcttcctc 
caagggagac 
acaaggcact 
gaggacctag 
ggctctctac 
gttctcctgg 
cagggaccca 
gtagaactcc 
cctctttctc 
cagacagcat 
agaccacaga 
gcagcacctc 
agatgaccaa 
actcagtgac 
atgttctcac 
tctcactgac 
aaaaaagcac 
aagccatctc 



cagctctgag 
tggtgccact 
aggctttgct 
tacctccagc 
tgggtctaca 
aacccactcg 
aagccctggt 
ttccctgctg 
ggatttccct 
cacagacagg 
cacctcccat 
ttccacacac 
ttctgtccta 
catgggggaa 
gatagaacca 
cagctcagcc 
ggtctccagg 
caccatatca 
gacagaatca 
gggtacGctc 
atctccagga 
gccatggccg 
acctgccatg 
tcatcctgtg 
aagctcagaa 
agccacgtct 
agtcaatgta 
agtgacaaca 
ttccacatca 
ttctggatta 
ttctctttct 
cttaggcaga 
ggaaaccatc 
cacccccaaa 
aagcagagcc 
gatgaccact 
ggaaaaaact 
actttattcc 
cacccctgtc 
ttcacctccc 
ggagacagag 
cgctagacaa 
tccagtggtc 
tgagataaca 
cagcacctcc 
cactagtgcc 
ccctgatcag 
ctcccccatc 
ggctacatca 
ctcaactgca 
tggagatgtg 
actgtcttca 
ccactcttct 
gctgttgggc 
agctgaaata 
tgtggtaacc 
aacaaaggcc 
atcaacccct 
tcctgggttg 
tgtcctttct 
ttctagcaga 
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282 01 acatccatcc caggccctgc tcaatccaca atgtcatcag acacctccat ggaaaccatc 
28261 actagaattt ctacccccct cacaaggaaa gaatcaacag acatggccat cacccccaaa 
28321 acaggtcctt ctggggctac ctcgcagggt acctttacct tggactcatc aagcacagco 
28381 tcctggccag gaactcactc agctacaact cagagatttc cacagtcagt ggtgacaact 
5 28441 cctatgagca gaggtcctga ggatgtgtca tggccaagcc cgctgtctgt ggaaaaaaac 

2 8501 agccctccat cttccctggt atcttcatct tcagtaacct caccttcgcc actttattcc 
28561 acaccatctg ggagtagcca ctcctctcct gtccctgtca cttctctttt cacctctatc 
28621 atgatgaagg ccacagacat gttggatgca agtttggaac ctgagaccac ttcagctccc 
28681 aatatgaata tcacctcaga tgagagtctg gccacttcta aagccaccac ggagacagag 

10 28741 gcaattcacg tttttgaaaa tacagcagcg tcccatgtgg aaaccaccag tgctacagag 

28801 gaactctatt cctcttcccc aggcttctca gagccaacaa aagtgatatc tccagtggtc 
28861 acctcttcct ctataagaga caacatggtt tccacaacaa tgcctggctc ctctggcatt 
28921 acaaggattg agatagagtc aatgtcatct ctgacccctg gactgaggga gaccagaacc 
28981 tcccaggaca tcacctcatc cacagagaca agcactgtcc tttacaagat gtcctctggt 

15 29041 gccactcctg aggtctccag gacagaagtt atgccctcta gcagaacatc cattcctggc 

29101 cctgctcagt ccacaatgtc actagacatc tccgatgaag ttgtcaccag gctgtctacc 
29161 tctcccatca tgacagaatc tgcagaaata accatcacca cccaaacagg ttattctctg 
29221 gctacatccc aggttaccct tcccttgggc acctcaatga cctttttgtc agggacccac 
29281 tcaactatgt ctcaaggact ttcacactca gagatgacca atcttatgag caggggtcct 

20 2 9341 gaaagtctgt catggacgag ccctcgcttt gtggaaacaa ctagatcttc ctcttctctg 

29401 acatcattac ctctcacgac ctcactttct cctgtgtcct ccacattact agacagtagc 
29461 ccctcctctc ctcttcctgt gacttcactt atcctcccag gcctggtgaa gactacagaa 
29521 gtgttggata caagctcaga gcctaaaacc agttcatctc caaatttgag cagcacctca 
29581 gttgaaatac cggccacctc tgaaatcatg acagatacag agaaaattca tccttcctca 

25 29641 aacacagcgg tggccaaagt gaggacctcc agttctgttc atgaatctca ttcctctgtc 

29701 ctagctgact cagaaacaac cataaccata ccttcaatgg gtatcacctc cgctgtggac 
29761 gataccactg ttttcacatc aaatcctgcc ttctctgaga ctaggaggat tccgacagag 
29821 ccaacattct cattgactcc tggattcagg gagactagca cctctgaaga gaccacctca 
29881 atcacagaaa caagtgcagt cctttatgga gtgcccacta gtgctactac tgaagtctcc 

30 29941 atgacagaaa tcatgtcctc taatagaaca cacatccctg actctgatca gtccacgatg 

30001 tctccagaca tcatcactga agtgatcacc aggctctctt cctcatccat gatgtcagaa 

3 0061 tcaacacaaa tgaccatcac cacccaaaaa agttctcctg gggctacagc acagagtact 
3 0121 cttaccttgg ccacaacaac agcccccttg gcaaggaccc actcaactgt tcctcctaga 
3 0181 tttttacact cagagatgac aactcttatg agtaggagtc ctgaaaatcc atcatggaag 

35 30241 agctctccct ttgtggaaaa aactagctct tcatcttctc tgttgtcctt acctgtcacg 

30301 acctcacctt ctgtttcttc cacattaccg cagagtatcc cttcctcctc tttttctgtg 
30361 acttcactcc tcaccccagg catggtgaag actacagaca caagcacaga acctggaacc 
3 0421 agtttatctc caaatctgag tggcacctca gttgaaatac tggctgcctc tgaagtcacc 
30481 acagatacag agaaaattca tccttcttca agcatggcag tgaccaatgt gggaaccacc 

40 3 0541 agttctggac atgaactata ttcctctgtt tcaatccact cggagccatc caaggctaca 

30601 tacccagtgg gtactccctc ttccatggct gaaacctcta tttccacatc aatgcctgct 
3 0661 aattttgaga ccacaggatt tgaggctgag ccattttctc atttgacttc tggatttagg 
30721 aagacaaaca tgtccctgga caccagctca gtcacaccaa caaatacacc ttcttctcct 
30781 gggtccactc accttttaca gagttccaag actgatttca cctcttctgc aaaaacatca 

45 30841 tccccagact ggcctccagc ctcacagtat actgaaattc cagtggacat aatcaccccc 

30901 tttaatgctt ctccatctat tacggagtcc actgggataa cctccttccc agaatccagg 
30961 tttactatgt ctgtaacaga aagtactcat catctgagta cagatttgct gccttcagct 
31021 gagactattt ccactggcac agtgatgcct tctctatcag aggccatgac ttcatttgcc 
31081 accactggag ttccacgagc catctcaggt tcaggtagtc cattctctag gacagagtca 

50 31141 ggccctgggg atgctactct gtccaccatt gcagagagcc tgccttcatc cactcctgtg 

31201 ccattctcct cttcaacctt cactaccact gattcttcaa ccatcccagc cctccatgag 
31261 ataacttcct cttcagctac cccatataga gtggacacca gtcttgggac agagagcagc 
31321 actactgaag gacgcttggt tatggtcagt actttggaca cttcaagcca accaggcagg 
31381 acatcttcaa cacccatttt ggataccaga atgacagaga gcgttgagct gggaacagtg 

55 31441 acaagtgctt atcaagttcc ttcactctca acacggttga caagaactga tggcattatg 

31501 gaacacatca caaaaatacc caatgaagca gcacacagag gtaccataag accagtcaaa 
31561 ggccctcaga catccacttc gcctgccagt cctaaaggac tacacacagg agggacaaaa 
31621 agaatggaga ccaccaccac agctttgaag accaccacca cagctttgaa gaccacttcc 
31681 agagccacct tgaccaccag tgtctatact cccactttgg gaacactgac tcccctcaat 

60 31741 gcatcaaggc aaatggccag cacaatcctc acagaaatga tgatcacaac cccatatgtt 

31801 ttccctgatg ttccagaaac gacatcctca ttggctacca gcctgggagc agaaaccagc 
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31861 acagctcttc ccaggacaac cccatctgtt ctcaatagag aatcagagac cacagcctca 
31921 ctggtctctc gttctggggc agagagaagt ccggttattc aaactctaga tgtttcttct 
31981 agtgagccag atacaacagc ttcatgggtt atccatcctg cagagaccat cccaactgtt 
32041 tccaagacaa cccccaattt tttccacagt gaattagaca ctgtatcttc cacagccacc 
5 32101 agtcatgggg cagacgtcag ctcagccatt ccaacaaata tctcacctag tgaactagat 

32161 gcactgaccc cactggtcac tatttcgggg acagatacta gtacaacatt cccaacactg 
32221 actaagtccc cacatgaaae agagacaaga accacatggc tcactcatcc tgcagagacc 
32281 agctcaacta ttcccagaac aatccccaat ttttctcatc atgaatcaga tgccacacct 
32341 tcaatagcca ccagtcctgg ggcagaaacc agttcagcta ttccaattat gactgtctca 
10 32401 cctggtgcag aagatctggt gacctcacag gtcactagtt ctgggacaga cagaaatatg 

32461 actattccaa ctttgactct ttctcctggt gaaccaaaga cgatagcctc attagtcacc 
32521 catcctgaag cacagacaag ttcggccatt ccaacttcaa ctatctcgcc tgctgtatca 
32581 cggttggtga cctcaatggt caccagtttg gcggcaaaga caagtacaac taatcgagct 
32641 ctgacaaact cccctggtga accagctaca acagtttcat tggtcacgca tcctgcacag 
15 32701 accagcccaa cagttccctg gacaacttcc atttttttcc atagtaaatc agacaccaca 

32761 ccttcaatga ccaccagtca tggggcagaa tccagttcag ctgttccaac tccaactgtt 
32821 tcaactgagg taccaggagt agtgacccct ttggtcacca gttctagggc agtgatcagt 
32881 acaactattc caattctgac tctttctcct ggtgaaccag agaccacacc ttcaatggcc 
32941 accagtcatg gggaagaagc cagttctgct attccaactc caactgtttc acctggggta 
20 33001 ccaggagtgg tgacctctct ggtcactagt tctagggcag tgactagtac aactattcca 

33061 attctgactt tttctcttgg tgaaccagag accacacctt caatggccac cagtcatggg 
33121 acagaagctg gctcagctgt tccaactgtt ttacctgagg taccaggaat ggtgacctct 
331B1 ctggttgcta gttctagggc agtaaccagt acaactcttc caactctgac tctttctcct 
33241 ggtgaaccag agaccacacc ttcaatggcc accagtcatg gggcagaagc cagctcaact 
25 33301 gttccaactg tttcacctga ggtaccagga gtggtgacct ctctggtcac tagttctagt 

33361 ggagtaaaca gtacaagtat tccaactctg attctttctc ctggtgaact agaaaccaca 
33421 ccttcaatgg ccaccagtca tggggcagaa gccagctcag ctgttccaac tccaactgtt 
33481 tcacctgggg tatcaggagt ggtgacccct ctggtcacta gttccagggc agtgaccagt 
33541 acaactattc caattctaac tctttcttct agtgagccag agaccacacc ttcaatggcc 
30 33601 accagtcatg gggtagaagc cagctcagct gttctaactg tttcacctga ggtaccagga 

33661 atggtgacct ctctggtcac tagttctaga gcagtaacca gtacaactat tccaactctg 
33721 actatttctt ctgatgaacc agagaccaca acttcattgg tcacccattc tgaggcaaag 
33781 atgatttcag ccattccaac tttagctgtc tcccctactg tacaagggct ggtgacttca 
33841 ctggtcacta gttctgggtc agagaccagt gcgttttcaa atctaactgt tgcctcaagt 
35 33901 caaccagaga ccatagactc atgggtcgct catcctggga cagaagcaag ttctgttgtt 

33961 ccaactttga ctgtctccac tggtgagccg tttacaaata tctcattggt cacccatcct 
34021 gcagagagta gctcaactct tcccaggaca acctcaaggt tttcccacag tgaattagac 
34081 actatgcctt ctacagtcac cagtcctgag gcagaatcca gctcagccat ttcaactact 
34141 atttcacctg gtataccagg tgtgctgaca tcactggtca ctagctctgg gagagacatc 
40 34201 agtgcaactt ttccaacagt gcctgagtcc ccacatgaat cagaggcaac agcctcatgg 

34261 gttactcatc ctgcagtcac cagcacaaca gttcccagga caacccctaa ttattctcat 
34321 agtgaaccag acaccacacc atcaatagcc accagtcatg gggcagaagc cacttcagat 
34381 tttccaacaa taactgtctc acctgatgta ccagatatgg taacctcaca ggtcactagt 
34441 tctgggacag acaccagtat aactattcca actctgactc tttcttctgg tgagccagag 
45 34501 accacaacct catttatcac ctattctgag acacacacaa gttcagccat tccaactctc 

34561 cctgtctccc ctggtgcatc aaagatgctg acctcactgg tcatcagttc tgggacagae 
34621 agcactacaa ctttcccaac actgacggag accccatatg aaccagagac aacagccata 
34681 cagctcattc atcctgeaga gaccaacaca atggttccca agacaactcc caagttttcc 
34741 catagtaagt cagacaecac actcccagta gccatcacca gtcctgggcc agaagccagt 
50 34801 tcagctgttt caacgacaac tatctcacct gatatgtcag atctggtgac ctcactggtc 

34861 cctagttctg ggacagacac cagtacaacc ttcccaacat tgagtgagac cccatatgaa 
34921 ccagagacta cagtcacgtg gctcactcat cctgcagaaa ccagcacaac ggtttctggg 
34981 acaattccoa acttttccca taggggatca gacactgcac cctcaatggt caccagtcct 
35041 ggagtagaca cgaggtcagg tgttccaact acaaccatcc cacccagtat accaggggta 
55 35101 gtgacctcac aggtcactag ttctgcaaca gacactagta cagctattcc aactttgact 

35161 ccttctcctg gtgaaccaga gaccacagcc tcatcagcta cccatcctgg gacacagact 
35221 ggcttcactg ttccaattcg gactgttccc tctagtgagc cagatacaat ggcttcctgg 
35281 gtcactcatc ctccacagac cagcacacct gtttccagaa caacctccag tttttcccat 
35341 agtagtccag atgccacacc tgtaatggcc accagtccta ggacagaagc cagttcagct 
35401 gtactgacaa caatctcacc tggtgcacca gagatggtga cttcacagat cactagttct 
35461 ggggcagcaa ccagtacaac tgttccaact ttgactcatt ctcctggtat gccagagacc 
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35521 acagccttat tgagcaccca tcccagaaca gggacaagta aaacatttcc tgcttcaact 
35581 gtgtttcctc aagtatcaga gaccacagcc tcactcacca ttagacctgg tgcagagact 
35641 agcacagctc tcccaactca gacaacatcc tctctcttca ccctacttgt aactggaacc 
35701 agcagagttg atctaagtcc aactgcttca cctggtgttt ctgcaaaaacj agccccactt 
5 35761 tccacccatc cagggacaga gaccagcaca atgattccaa cttcaactct ttcccttggt 

35821 ttactagaga ctacaggctt actggccacc agctcttcag cagagaccag cacgagtact 
3 5881 ctaactctga ctgtttcccc tgctgtctct gggctttcca gtgcctctat aacaactgat 
35941 aagccccaaa ctgtgacctc ctggaacaca gaaacctcac catctgtaac ttcagttgga 
36001 cccccagaat tttccaggac tgtcacaggc accactatga ccttgatacc atcagagatg 
10 36061 ccaacaccac ctaaaaccag tcatggagaa ggagtgagtc caaccactat cttgagaact 

36121 acaatggttg aagccactaa tttagctacc acaggttcca gtcccactgt ggccaagaca 
36181 acaaccacct tcaatacact ggctggaagc ctctttactc ctctgaccac acctgggatg 
36241 tccaccttgg cctctgagag tgtgacctca agaacaagtt ataaccatcg gtcctggatc 
36301 tccaccacca gcagttataa cegtcggtac tggacccctg ccaccagcac tccagtgact 
15 36361 tctacattct ccccagggat ttccacatcc tccatcccca gctccacagc agccacagtc 

36421 ccattcatgg tgccattcac cctcaacttc accatcacca acctgcagta cgaggaggac 
36481 atgcggcacc ctggttccag gaagttcaac gccacagaga gagaactgca gggtctgctc 
36541 aaacccttgt tcaggaatag cagtctggaa tacctctatt caggctgcag actagcctca 
36601 ctcaggccag agaaggatag ctcagccatg gcagtggatg ccatctgcac acatcgccct 
20 36661 gaccctgaag acctcggact ggacagagag cgactgtact gggagctgag caatctgaca 

36721 aatggcatcc aggagctggg cccctacacc ctggaccgga acagtctcta tgtcaatggt 
36781 ttcacccatc gaagctctat gcccaccacc agcactcctg ggacctccac agtggatgtg 
36841 ggaacctcag ggactccatc ctccagcccc agccccacgg ctgctggccc tctcctgatg 
36901 ccgttcaccc tcaacttcac catcaccaac ctgcagtacg aggaggacat gcgtcgcact 
25 36961 ggctccagga agttcaacac catggagagt gtcctgcagg gtctgctcaa gcccttgttc 

37021 aagaacacca gtgttggccc tctgtactct ggctgcagat tgaccttgct caggcccgag 
37081 aaagatgggg cagccactgg agtggatgcc atctgcaccc accgccttga ccccaaaagc 
37141 cctggactca acagggagca gctgtactgg gagctaagca aactgaccaa tgacattgaa 
37201 gagctgggcc cctacaccct ggacaggaac agtctctatg tcaatggttt cacccatcag 
agctctgtgt ccaccaccag cactcctggg acctccacag tggatctcag aacctcaggg 
37321 actccatcct ccctctccag ccccacaatt atggctgctg gccctctcct ggtaccattc 
37381 accctcaact tcaccatcac caacctgcag tatggggagg acatgggtca ccctggctcc 
37441 aggaagttca acaccacaga gagggtcctg cagggtctgc ttggtcccat attcaagaac 
37501 accagtgttg gccctctgta ctctggctgc agactgacct ctctcaggtc tgagaaggat 
35 3 7561 ggagcagcca ctggagtgga tgccatctgc atccatcatc ttgaccccaa aagccctgga 

37621 ctcaacagag agcggctgta ctgggagctg agccaactga ccaatggcat caaagagctg 
37681 ggcccctaca ccctggacag gaacagtctc tatgtcaatg gtttcaccca tcggacctct 
37741 gtgcccacca ccagcactcc tgggacctcc acagtggacc ttggaacctc agggactcca 
378 01 ttctccctcc caagccccgc aactgctggc cctctcctgg tgctgttcac cctcaacttc 
40 37861 accatcacca acctgaagta tgaggaggac atgcatcgcc ctggctccag gaagttcaac 

37921 accactgaga gggtcctgca gactctgctt ggtcctatgt tcaagaacac cagtgttggc 
37981 cttctgtact ctggctgcag actgaccttg ctcaggtccg agaaggatgg agcagccact 
38041 ggagtggatg ccatctgcac ccaccgtctt gaccccaaaa gccctggact ggacagagag 
3 8101 cagctatact gggagctgag ccagctgacc aatggcatca aagagctggg cccctacacc 
45 38161 ctggacagga acagtctcta tgtcaatggt ttcacccatt ggatccctgt gcccaccagc 

38221 agcactcctg ggacctccac agtggacctt gggtcaggga ctccatcctc cctccccagc 
38281 cccacagctg ctggccctct cctggtgcca ttcaccctca acttcaccat caccaacctg 
38341 cagtacgagg aggacatgca tcacccaggc tccaggaagt tcaacaccac ggagcgggtc 
38401 ctgcagggtc tgcttggtcc catgttcaag aacaccagtg tcggccttct gtactctggc 
50 38461 tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 

38521 tgcacccacc gtcttgaccc caaaagccct ggagtggaca gggagcagct atactgggag 
38581 ctgagccagc tgaccaatgg catcaaagag ctgggtccct acaccctgga cagaaacagt 
38641 ctctatgtca atggtttcac ccatcagacc tctgcgccca acaccagcac tcctgggacc 
38701 tccacagtgg accttgggac ctcagggact ccatcctccc tccccagccc tacatcngct 
55 38761 ggccctctcc tggtnccntt caccctcaac ttcaccatca ccaacctgca gtacgaggag 

38821 gacatgcggc acccnggntc caggaagttc aacaccacng agagggtnct gcagggtctg 
38881 ctnaagcccc tnttcaagag caccagtgtt ggccctctgt actctggctg cagactgacc 
38941 ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 
39001 cttgacccca aaagccctgg agtggacagg gagcagctat actgggagct gagccagctg 
60 39061 accaatggca tcaaagagct gggtccctac accctggaca gaaacagtct ctatgtcaat 

39121 ggtttcaccc atcagacctc tgcgcccaac accagcactc ctgggacctc cacagtggac 
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39181 cttgggacct cagggactcc atcctccctc cccagcccta catctgctgg ccctctcctg 
39241 gtgccattca ccctcaactt caccatcacc aacctgcagt acgaggagga catgcatcac 
39301 ccaggctcca ggaagttcaa caccacggag cgggtcctgc agggtctgct tggtcccatg 
39361 ttcaagaaca ccagtgtcgg ccttctgtac tctggctgca gactgacctt gctcaggcct 
5 39421 gagaagaatg gggcagccac tggaatggat gccatctgca gccaccgtct tgaccccaaa 

39481 agccctggac tcaacagaga gcagctgtac tgggagctga gccagctgac ccatggcatc 
39541 aaagagctgg gcccctacac cctggacagg aacagtctct atgtcaatgg tttcacccat 
39601 cggagctctg tggcccccac cagcactcct gggacctcca cagtggacct tgggacctca 
39661 gggactccat cctccctccc cagccccaca acagctgttc ctctcctggt gccgttcacc 
10 39721 ctcaacttta ccatcaccaa tctgcagtat ggggaggaca tgcgtcaccc tggctccagg 

39781 aagttcaaca ccacagagag ggtcctgcag ggtctgcttg gtcccttgtt caagaactcc 
39841 agtgtcggcc ctctgtactc tggctgcaga ctgatctctc tcaggtctga gaaggatggg 
39901 gcagccactg gagtggatgc catctgcacc caccacctta accctcaaag ccctggactg 
39961 gacagggagc agctgtactg gcagctgagc cagatgacca atggcatcaa agagctgggc 
40021 ccctacaccc tggaccggaa cagtctctac gtcaatggtt tcacccatcg gagctctggg 
40081 ctcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 
40141 cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct caacttcacc 
40201 atcaccaacc tgcagtatga ggaggacatg catcgccctg gatctaggaa gttcaacacc 
40261 acagagaggg tcctgcaggg tctgcttagt cccattttca agaactccag tgttggccct 
20 40321 ctgtactctg gctgcagact gacctctctc aggcccgaga aggatggggc agcaactgga 

40381 atggatgctg tctgcctcta ccaccctaat cccaaaagac ctggactgga cagagagcag 
40441 ctgtactggg agctaagcca gctgacccac aacatcactg agctgggccc ctacagcctg 
40501 gacagggaca gtctctatgt caatggtttc acccatcaga actctgtgcc caccaccagt 
40561 actcctggga cctccacagt gtactgggca accactggga ctccatcctc cttccccggc 
25 40621 cacacagagc ctggccctct cctgatacca ttcactttca actttaccat caccaacctg 

40681 cattatgagg aaaacatgca acaccctggt tccaggaagt tcaacaccac ggagagggtt 
40741 ctgcagggtc tgctcaagcc cttgttcaag aacaccagtg ttggccctct gtactctggc 
40801 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
4 0861 tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgggag 
30 4 0921 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 

4 0981 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 
41041 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 
41101 ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 
41161 aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 
35 41221 ctcaagccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 

41281 ttgctcagac ctgagaagca tgaggcagcc actggagtgg acaccatctg tacccaccgc 
41341 gttgatccca tcggacctgg actggacagg gagcggctat actgggagct gagccagctg 
41401 accaacagca ttaccgaact gggaccctac accctggaca gggacagtct ctatgtcaat 
41461 ggcttcaacc ctcggagctc tgtgccaacc accagcactc ctgggacctc cacagtgcac 
40 41521 ctggcaacct ctgggactcc atcctccctg cctggccaca cagcccctgt ccctctcttg 

41581 ataccattca ccctcaactt taccatcacc aacctgcatt atgaggaaaa catgcaacac 
41641 cctggttcca ggaagttcaa caccacggag agggttctgc agggtctgct caagcccttg 
41701 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcagacct 
41761 gagaagcatg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcecatc 
45 41821 ggacctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 

41881 mmgagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
41941 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 
42001 gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 
42061 ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 
42121 aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 
42181 agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 
42241 gcagccactg gaatggatgc catctgcagc caccgtcttg accccaaaag ccctggactc 
42301 gacagagagc agctgtactg ggagctgagc cagctgaccc atggcatcaa agagctgggc 
42361 ccctacaccc tggacaggaa cagtctctat gtcaatggtt tcacccatcg gagctctgtg 
42421 gcccccacca gcactcctgg gacctccaca gtggaccttg ggacctcagg gactccatcc 
42481 tccctcccca gccccacaac agctgttcct ctcctggtgc cgttcaccct caactttacc 
42541 atcaccaatc tgcagtatgg ggaggacatg cgtcaccctg gctccaggaa gttcaacacc 
42601 acagagaggg tcctgcaggg tctgcttggt cccttgttca agaactccag tgtcggccct 
42661 ctgtactctg gctgcagact gatctctctc aggtctgaga aggatggggc agccactgga 
42721 gtggatgcca tctgcaccca ccaccttaac cctcaaagcc ctggactgga cagggagcag 
42781 ctgtactggc agctgagcca gatgaccaat ggcatcaaag agctgggccc ctacaccctg 
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42841 gaccggaaca gtctctacgt caatggtttc acccatcgga gctctgggct caccaccagc 
42901 actccttgga cttccacagt tgaccttgga acctcaggga ctccatcccc cgtccccagc 
42961 cccacaactg ctggccctct cctggtgcca ttcaccctaa acttcaccat caccaacctg 
43021 cagtatgagg aggacatgca tcgccctgga tctaggaagt tcaacgccac agagagggtc 
5 43081 ctgcagggtc tgcttagtcc catattcaag aactccagtg ttggccctct gtactctggc 

43141 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
43201 tgcctctacc accctaatcc caaaagacct ggactggaca gagagcagct gtactgggag 
43261 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 
43321 ctctatgtca atggtttcac ccatcagagc tctatgacga ccaccagaac tcctgatacc 
10 43381 tccacaatgc acctggcaac ctcgagaact ccagcctccc tgtctggacc tacgaccgcc 

43441 agccctctcc tggtgctatt cacaatcaac tgcaccatca ccaacctgca gtacgaggag 
43501 gacatgcgtc gcactggctc caggaagttc aacaccatgg agagtgtcct gcagggtctg 
43561 ctcaagccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagattgacc 
43621 ttgctcaggc ccaagaaaga tggggcagcc actggagtgg atgccatctg cacccaccgc 
15 43681 cttgacccca aaagccctgg actcaacagg gagcagctgt actgggagct aagcaaactg 

43741 accaatgaca ttgaagagct gggcccctac accctggaca ggaacagtct ctatgtcaat 
43801 ggtttcaccc atcagagctc tgtgtccacc accagcactc ctgggacctc cacagtggat 
43861 ctcagaacct cagggactcc atcctccctc tccagcccca caattatgnc nnctgnccct 
43921 ctcctgntnc cnttcaccnt caacttnacc atcaccaacc tgcantangn ggannacatg 
20 43981 cnncncccng gntccaggaa gttcaacacc acngagaggg tcctacaggg tctgctcagg 

44041 cccttgttca agaacaccag tgtcagctct ctgtactctg gttgcagact gaccttgctc 
44101 aggcctgaga aggatggggc agccaccaga gtggatgctg cctgcaccta ccgccctgat 
44161 cccaaaagcc ctggactgga cagagagcaa ctatactggg agctgagcca gctaacccac 
44221 agcatcactg agctgggacc ctacaccctg gacagggtca gtctctatgt caatggcttc 
25 44281 aaccctcgga gctctgtgcc aaccaccagc actcctggga cctccacagt gcacctggca 

44341 acctctggga ctccatcctc cctgcctggc cacacancnn ctgnccctct cctgntnccn 
44401 ttcaccntca acttnaccat caccaacctg cantangngg annacatgcn ncncccnggn 
44461 tccaggaagt tcaacaccac ngagagggtt ctgcagggtc tgctcaaacc cttgttcagg 
44521 aatagcagtc tggaatacct ctattcaggc tgcagactag cctcactcag gccagagaag 
44581 gatagctcag ccatggcagt ggatgccatc tgcacacatc gccctgaccc tgaagacctc 
44641 ggactggaca gagagcgact gtactgggag ctgagcaatc tgacaaatgg catccaggag 
44701 ctgggcccct acaccctgga ccggaacagt ctctacgtca atggtttcac ccatcggagc 
44761 tctgggctca ccaccagcac tccttggact tccacagttg accttggaac ctcagggact 
44821 ccatcccccg tccccagccc cacaactgct ggccctctcc tggtgccatt caccctcaac 
35 44881 ttcaccatca ccaacctgca gtatgaggag gacatgcatc gccctggttc caggaggttc 

44941 aacaccacgg agagggttct gcagggtctg ctcacgccct tgttcaagaa caccagtgtt 
45001 ggccctctgt actctggctg cagactgacc ttgctcagac ctgagaagca agaggcagcc 
45061 actggagtgg acaccatctg tacccaccgc gttgatccca tcggacctgg actggacaga 
45121 gagcggctat actgggagct gagccagctg accaacagca tcacagagct gggaccctac 
40 45181 accctggata gggacagtct ctatgtcaat ggcttcaacc cttggagctc tgtgccaacc 

45241 accagcactc ctgggacctc cacagtgcac ctggcaacct ctgggactcc atcctccctg 
45301 cctggccaca cagcccctgt ccctctcttg ataccattca ccctcaactt taccatcacc 
45361 gacctgcatt atgaagaaaa catgcaacac cctggttcca ggaagttcaa caccacggag 
45421 agggttctgc agggtctgct caagcccttg ttcaagagca ccagcgttgg ccctctgtac 
45 454 81 tctggctgca gactgacctt gctcagacct gagaaacatg gggcagccac tggagtggac 

45541 gccatctgca ccctccgcct tgatcccact ggtcctggac tggacagaga gcggetatac 
45601 tgggagctga gccagctgac caacagcgtt acagagctgg gcccctacac cctggacagg 
45661 gacagtctct atgtcaatgg cttcacccat cggagctctg tgccaaccac cagtattcct 
45721 gggacctctg cagtgcacct ggaaacctct gggactccag ectccctccc tggccacaca 
50 45781 gcccctggcc ctctcctggt gccattcacc ctcaacttca ctatcaccaa cctgcagtat 

45841 gaggaggaca tgcgtcaccc tggttccagg aagttcagca ccacggagag agtcctgcag 
45901 ggtctgctca agcccttgtt caagaacacc agtgtcagct ctctgtactc tggttgcaga 
45961 etgaccttgc tcaggcctga gaaggatggg gcagccacca gagtggatgc tgtctgcacc 
46021 catcgtcctg accccaaaag ccctggactg gacagagagc ggctgtactg gaagctgagc 
55 46081 cagctgaccG acggcatcac tgagctgggc ccctacaccc tggacaggca cagtctctat 

46141 gtcaatggtt tcacccatca gagctctatg acgaccacca gaactcctga tacctccaca 
46201 atgcacctgg caacctcgag aactccagcc tccctgtctg gacctacgac cgccagccct 
46261 ctcctggtgc tattcacaat taacttcacc atcactaacc tgcggtatga ggagaacatg 
46321 catcaccctg gctctagaaa gtttaacacc acggagagag tccttcaggg tctgctcagg 
60 46381 cctgtgttca agaacaccag tgttggccct ctgtactctg gctgcagact gaccacgctc 

46441 aggcccaaga aggatggggc agccaccaaa gtggatgcca tctgcaccta ccgccctgat 
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46501 cccaaaagcc ctggactgga cagagagcag ctatactggg agctgagcca gctaacccac 
46561 agcatcactg agctgggccc ctacacccag gacagggaca gtctctatgt caatggcttc 
46621 acccatcgga gctctgtgcc aaccaccagt attcctggga cctctgcagt gcacctggaa 
4 6681 acctctggga ctccagcctc cctccctggc cacacagccc ctggccctct cctggtgcca 
5 46741 ttcaccctca acttcactat caccaacctg cagtatgagg aggacatgcg tcaccctggt 

46801 tccaggaagt tcaacaccac ggagagagtc ctgcagggtc tgctcaagcc cttgttcaag 
46861 agcaccagtg ttggccctct gtactctggc tgcagactga ccttgctcag gcctgaaaaa 
4 6921 cgtggggcag ccaccggcgt ggacaccatc tgcactcacc gccttgaccc tctaaaccca 
4 6981 ggactggaca gagagcagct atactgggag ctgagcaaac tgacccgtgg catcatcgag 

10 47041 ctgggcccct acctcctgga cagaggcagt ctctatgtca atggtttcac ccatcggacc 

47101 tctgtgccca ccaccagcac tcctgggacc tccacagtgg accttggaac ctcagggact 
47161 ccattctccc tcccaagccc cgcancnnct gnccctctcc tgntnccntt caccntcaac 
47221 ttnaccatca ccaacctgca ntangnggan nacatgcnnc ncccnggntc caggaagttc 
47281 aacaccacng agagggtcct gcagactctg cttggtccta tgttcaagaa caccagtgtt 

15 47341 ggccttctgt actctggctg cagactgacc ttgctcaggt ccgagaagga tggagcagcc 

47401 actggagtgg atgccatctg cacccaccgt cttgacccca aaagccctgg agtggacagg 
47461 gagcaactat actgggagct gagccagctg accaatggca ttaaagaact gggcccctac 
47521 accctggaca ggaacagtct ctatgtcaat gggttcaccc attggatccc tgtgcccacc 
47581 agcagcactc ctgggacctc cacagtggac cttgggtcag ggactccatc ctccctcccc 

20 47641 agcGccacaa ctgctggccc tctcctggtg ccgttcaccc tcaacttcac catcaccaac 

47701 ctgaagtacg aggaggacat gcattgccct ggctccagga agttcaacac cacagagaga 
47761 gtcctgcaga gtctgcttgg tcccatgttc aagaacacca gtgttggccc tctgtactct 
47821 ggctgcagac tgaccttgct caggtccgag aaggatggag cagccactgg agtggatgcc 
47881 atctgcaccc accgtcttga ccccaaaagc cctggagtgg acagggagca gctatactgg 

25 47941 gagctgagcc agctgaccaa tggcatcaaa gagctgggtc cctacaccct ggacagaaac 

48001 agtctctatg tcaatggttt cacccatcag acctctgcgc ccaacaccag cactcctggg 
48061 acctccacag tggaccttgg gacctcaggg actccatcct ccctccccag ccctacancn 
48121 nctcfziccctc tcctgntncc nttcaccntc aacttnacca tcaccaacct gcantangng 
48181 gannacatgc nncncccngg ntccaggaag ttcaacacca cngagngngt nctgcagggt 

30 48241 ctgctnnnnc ccntnttcaa gaacnccagt gtnggccntc tgtactctgg ctgcagactg 

48301 acctimctca ggncngagaa gnatggngca gccactggan tggatgccat ctgcanccac 
48361 cnncntnanc ccaaaagncc tggactgnac agngagcngc tntactggga gctnagccan 
48421 ctgaccaann ncatcniinga gctgggnccc tacaccctgg acaggnacag tctctatgtc 
48481 aatggtttca cccattggat ccctgtgccc accagcagca ctcctgggac ctccacagtg 

35 4 8541 gaccttgggt cagggactcc atcctccctc cccagcccca caactgctgg ccctctcctg 

4 8601 gtgccgttca ccctcaactt caccatcacc aacctgaagt acgaggagga catgcattgc 
4 8661 cctggctcca ggaagttcaa caccacagag agagtcctgc agagtctgct tggtcccatg 
4 8721 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctc gctcaggtcc 
4 8781 gagaaggatg gagcagccac tggagtggat gccatctgca cccaccgtgt tgaccccaaa 

40 48841 agccctggag tggacaggga gcagctatac tgggagctga gccagctgac caatggcatc 

48901 aaagagctgg gtccctacac cctggacaga aacagtctct atgtcaatgg tttcacccat 
48961 cagacctctg cgcccaacac cagcactcct gggacctcca cagtgnacnt nggnacctcn 
49021 gggactccat cctccntccc cixgccncaca tctgctggcc ctctcctggt gccattcacc 
49081 ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 

45 49141 aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 

49201 agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 
49261 gcaaccactg gaatggatgc catctgcacc caccgtcttg accccaaaag ccctggactg 
49321 nacagngagc ngctntactg ggagctnagc ceuictgacca annncatcnn ngagctgggn 
49381 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 

50 49441 cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 

49501 tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 
49561 atcaccaacc tgcantangn ggannacatg cnncncccoig gntccaggaa gttcaacacc 
49621 acngagaggg ttctgcaggg tctgctcaaa cccttgttca ggaatagcag tctggaatac 
49681 ctctattcag gctgcagact agcctcactc aggccagaga aggatagctc agccatggca 

55 49741 gtggatgcca tctgcacaca tcgccctgac cctgaagacc tcggactgga cagagagcga 

49801 ctgtactggg agctgagcaa tctgacaaat ggcatccagg agctgggccc ctacaccctg 
49861 gaccggaaca gtctctatgt caatggtttc acccatcgaa gctctatgcc caccaccagc 
49921 actcctggga cctccacagt ggatgtggga acctcaggga ctccatcctc cagccccagc 
49981 cccacgactg ctggccctct cctgatacca ttcaccctca acttcaccat caccaacctg 

60 50041 cagtatgggg aggacatggg tcaccctggc tccaggaagt tcaacaccac agagagggtc 

50101 ctgcagggtc tgcttggtcc catattcaag aacaccagtg ttggccctct gtactctggc 
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5 0161 tgcagactga cctctctcag gtctgagaag gatggagcag ccactggagt ggatgccatc 
50221 tgcatccatc atcttgaccc caaaagccct ggactcaaca gagagcggct gtactgggag 
50281 ctgagccaac tgaccaatgg catcaaagag ctgggcccct acaccctgga caggaacagt 
50341 ctctatgtca atggtttcac ccatcggacc tctgtgccca ccaccagcac tcctgggacc 
5 50401 tccacagtgg accttggaac ctcagggact ccattctccc tcccaagccc cgcaactgct 

50461 ggccctctcc tggtgctgtt caccctcaac ttcaccatca ccaacctgaa gtatgaggag 
50521 gacatgcatc gccctggctc caggaagttc aacaccactg agagggtcct gcagactctg 
50581 cttggtccta tgttcaagaa caccagtgtt ggccttctgt actctggctg cagactgacc 
50641 ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 

10 50701 cttgacccca aaagccctgg actgnacagn gagcngctnt actgggagct nagccanctg 

50761 accaazumca tcnzmgagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
50821 ggtttcaccc atcnganctc tgngcccacc accagcactc ctgggacctc cacagtgnac 
50881 ntnggnacct cngggactcc atcctccntc cccngccnca cancnnctgn ccctctcctg 
50941 ntnccnttca ccntcaactt naccatcacc aacctgcant suignggaxma catgcnncnc 

15 51001 ccnggntcca ggaagttcaa caccacngag agagtccttc agggtctgct caggcctgtg 

51061 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcaggccc 
51121 aagaaggatg gggcagccac caaagtggat gccatctgca cctaccgccc tgatcccaaa 
51181 agccctggac tggacagaga gcagctatac tgggagctga gccagctaac ccacagcatc 
51241 actgagctgg gcccctacac ccaggacagg gacagtctct atgtcaatgg cttcacccat 

20 51301 cggagctctg tgccaaccac cagtattcct gggacctctg cagtgcacct ggaaaccact 

51361 gggactccat cctccttccc cggccacaca gagcctggcc ctctcctgat accattcact 
51421 ttcaacttta ccatcaccaa cctgcgttat gaggaaaaca tgcaacaccc tggttccagg 
51481 aagttcaaca ccacggagag ggttctgcag ggtctgctca cgcccttgtt caagaacacc 
51541 agtgttggcc ctctgtactc tggctgcaga ctgaccttgc tcagacctga gaagcaggag 

25 51601 gcagccactg gagtggacac catctgtacc caccgcgttg atcccatcgg acctggactg 

51661 gacagagagc ggctatactg ggagctgagc cagctgacca acagcatcac agagctggga 
51721 ccctacaccc tggataggga cagtctctat gtcgatggct tcaacccttg gagctctgtg 
51781 ccaaccacca gcactcctgg gacctccaca gtgcacctgg caacctctgg gactccatcc 
51841 cccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 

30 51901 atcaccgacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 

51961 acggagaggg ttctgcaggg tctgctcaag cccttgttca agagcaccag cgttggccct 
52021 ctgtactctg gctgcagact gaccttgctc agacctgaga aacatggggc agccactgga 
52081 gtggacgcca tctgcaccct ccgccttgat cccactggtc ctggactgga cagagagcgg 
52141 ctatactggg agctgagcca gctgaccaac agcatcacag agctgggacc ctacaccctg 

35 52201 gatagggaca gtctctatgt caatggcttc aacccttgga gctctgtgcc aaccaccagc 

522 61 actcctggga cctccacagt gcacctggca acctctggga ctccatcctc cctgcctggc 
52321 cacacaactg ctggccctct cctggtgccg ttcaccctca acttcaccat caccaacctg 
52381 aagtacgagg aggacatgca ttgccctggc tccaggaagt tcaacaccac agagagagtc 
52441 ctgcagagtc tgcatggtcc catgttcaag aacaccagtg ttggccctct gtactctggc 

40 52501 tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 

52561 tgcacccacc gtcttgaccc caaaagccct ggactgnaca gngagcngct ntactgggag 
52621 ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 
52681 ctctatgtca atggtttcac ccatcnganc tctgngccca ccaccagcac tcctgggacc 
52741 tccacagtgn acntnggnac ctcngggact ccatcctccn tccccngccn cacancnnct 

45 52801 gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 

52861 nacatgcnnc ncccnggntc caggaagttc aacaccacng agngngtnct gcagggtctg 
52921 ctnzmncccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 
52981 tnnctcaggn cngagaagna tggngcagcc actggantgg atgccatctg canccaccnn 
53041 cntnanccca aaagncctgg actgnacagn gagcngctnt actgggagct nagccanctg 

50 53101 accaacagca tcacagagct gggaccctac accctggata gggacagtct ctatgtcaat 

53161 ggtttcaccc atcgaagctc tatgcccacc accagtattc ctgggacctc tgcagtgcac 
53221 ctggaaacct ctgggactcc agcctccctc cctggccaca cagcccctgg ccctctcctg 
53281 gtgccattca ccctcaactt cactatcacc aacctgcagt atgaggagga catgcgtcac 
53341 cctggttcca ggaagttcaa caccacggag agagtcctgc agggtctgct caagcccttg 

55 53401 ttcaagagca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcaggcct 

53461 gaaaaacgtg gggcagccac cggcgtggac accatctgca ctcaccgcct tgaccctcta 
53521 aaccctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 
53581 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
53641 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

60 53701 gggactccat cctccntccc cngccncaca ncnnctgncc ctctcctgnt nccnttcacc 

53761 ntcaacttna ccatcaccaa cctgcantan gnggannaca tgcnncnccc nggntccagg 
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53821 aagttcaaca ccacngagng ngtnctgcag ggtctgctnn nncccntntt caagaacncc 
538 81 agtgtnggcc ntctgtactc tggctgcaga ctgacctnnc tcaggncnga gaagnatggn 
53941 gcagccactg gantggatgc catctgcanc caccnncntn ancccaaaag ncctggactg 
54001 nacagngagc ngctntactg ggagctnagc canctgacca anzincatcnn ngagctgggn 
5 54061 ccctacaccc tggacaggxia cagtctctat gtcaatggtt ttcaccctcg gagctctgtg 

54121 ccaaccacca gcactcctgg gacctccaca gtgcacctgg caacctctgg gactccatcc 
54181 tccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 
54241 atcaccaacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 
54301 acggagcggg tcctgcaggg tctgcttggt cccatgttca agaacacaag tgtcggcctt 

10 54361 ctgtactctg gctgcagact gaccttgctc aggcctgaga agaatggggc agccactgga 

54421 atggatgcca tctgcagcca ccgtcttgac cccaaaagcc ctggactgna cagngagcng 
54481 ctntactggg agctnagcca nctgaccaan zmcatcnnng agctgggncc ctacaccctg 
54541 gacaggnaca gtctctatgt caatggtttc acccatcnga nctctgngcc caccaccagc 
54601 actcctggga cctccacagt gnacntnggn acctcnggga ctccatcctc cntccccngc 

15 54661 cncacaxicim ctgnccctct cctgntnccn ttcaccntca acttnaccat caccaacctg 

54721 cantangngg annacatgcn ncncccnggn tccaggaagt tcaacaccac ngagngngtn 
54781 ctgcagggtc tgctxuumcc cntnttcaag aacnccagtg tnggccntct gtactctggc 
54841 tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgccato 
54901 tgcanccacc imcntnancc caaaagncct ggactgnaca gngagcngct ntactgggag 

20 54961 ctnagccanc tgaccaeuinn catczinngag ctgggnccct acaccctgga caggnacagt 

55021 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 
55081 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 
55141 ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 
55201 aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 

25 55261 ctcacgccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 

55321 ttgctcagac ctgagaagca ggaggcagcc actggagtgg acaccatctg tacccaccgc 
55381 gttgatccca tcggacctgg actgnacagn gagcngctnt actgggagct nagccanctg 
55441 accaannnca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
55501 ggtttcaccc atcnganctc tgngcccacc accagcactc ctgggacctc cacagtgnac 

30 55561 ntnggnacct cngggactcc atcctccntc cccngccnca cancimctgn ccctctcctg 

55621 ntnccnttca ccntcaactt naccatcacc aacctgcant angngganna catgcnncnc 
55681 ccnggntcca ggaagttcaa caccacngag ngngtnctgc agggtctgct nimncccntn 
55741 ttcaagaacn ccagtgtngg ccntctgtac tctggctgca gactgacctn nctcaggncn 
558 01 gagaagnatg gngcagccac tggantggat gccatctgca ixccacciincn tnancccaaa 

35 55861 agncctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 

55921 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
55981 cggagctctg tgccaaccac cagcagtcct gggacctcca cagtgcacct ggcaacctct 
56041 gggactccat cctccctgcc tggccacaca gcccctgtcc ctctcttgat accattcacc 
56101 ctcaacttta ccatcaccaa cctgcattat gaagaaaaca tgcaacaccc tggttccagg 

40 56161 aagttcaaca ccacggagag ggttctgcag ggtctgctca agcccttgtt caagagcacc 

56221 agtgttggcc ctctgtactc tggctgcaga ctgaccttgc tcagacctga gaaacatggg 
56281 gcagccactg gagtggacgc catctgcacc ctccgccttg atcccactgg tcctggactg 
56341 nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 
56401 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 

45 56461 cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 

56521 tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 
56581 atcaccaacc tgcantangn ggaimacatg cnncncccng gntccaggaa gttcaacacc 
56641 acngagngng tnctgcaggg tctgctnnnn cccntnttca agaacnccag tgtnggccnt 
56701 ctgtactctg gctgcagact gacctxmctc aggncngaga agnatggngc agccactgga 

50 56761 ntggatgcca tctgcancca ccnncntnan cccaaaagnc ctggactgna cagngagcng 

56821 ctntactggg agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 
56881 gacaggnaca gtctctatgt caatggtttc acccatcgga cctctgtgcc caccaccagc 
56941 actcctggga cctccacagt gcacctggca acctctggga ctccatcctc cctgcctggc 
57001 cacacagccc ctgtccctct cttgatacca ttcaccctca actttaccat caccaacctg 

55 57061 cagtatgagg aggacatgca tcgccctgga tctaggaagt tcaacaccac agagagggtc 

57121 ctgcagggtc tgcttagtcc cattttcaag aactccagtg ttggccctct gtactctggc 
57181 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
57241 tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgcgag 
57301 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 

60 57361 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 

57421 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacancnnct 
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57481 gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 
57541 nacatgcnnc ncccnggntc caggaagttc aacaccacng agngugtnct gcagggtctg 
57601 ctnnimcccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 
57661 tnnctcaggn cngagaagna tggngcagcc actggantgg atgccatctg canccaccnn 
5 57721 cntnanccca aaagncctgg actgnacagn gagcngctnt actgggagct nagccanctg 

57781 accaanimca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
57841 ggtttcaccc attggagctc tgggctcacc accagcactc cttggacttc cacagttgac 
57901 cttggaacct cagggactcc atcccccgtc cccagcccca caactgctgg ccctctcctg 
57961 gtgccattca ccctaaactt caccatcacc aacctgcagt atgaggagga catgcatcgc 
10 58021 cctggatcta ggaagttcaa cgccacagag agggtcctgc agggtctgct tagtcccata 

58081 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcagacct 
58141 gagaagcagg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcccatc 
58201 ggacctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 
58261 mmgagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
58321 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 
58381 gggactccat cctccntccc cngccncaca ucnnctgncc ctctcctgnt nccnttcacc 
58441 ntcaacttna ccatcaccaa cctgcantan gnggannaca tgcnncnccc nggntccagg 
58501 aagttcaaca ccacngagng ngtnctgcag ggtctgctiin nncccntntt caagaacncc 
58561 agtgtnggcc ntctgtactc tggctgcaga ctgacctnnc tcaggncnga gaagnatggn 
58621 gcagccactg gantggatgc catctgcanc caccnncntn ancccaaaag ncctggactg 
58681 nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 
58741 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcg gagctttggg 
58801 Gtcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 
58861 cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct aaacttcacc 
25 58921 atcaccaacc tgcagtatga ggaggacatg catcgccctg gctccaggaa gttcaacacc 

58981 acggagaggg tccttcaggg tctgcttacg cccttgttca ggaacaccag tgtcagctct 
59041 ctgtactctg gttgcagact gaccttgctc aggcctgaga aggatggggc agccaccaga 
59101 gtggatgctg tctgcaccca tcgtcctgac cccaaaagcc ctggactgna cagngagcng 
59161 ctntactggg agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 
30 59221 gacaggnaca gtctctatgt caatggtttc acccatcnga nctctgngcc caccaccagc 

59281 actcctggga cctccacagt gnacntnggn acctcnggga ctccatcctc cntccccngc 
59341 cncacancnn ctgnccctct cctgntnccn ttcaccntca acttnaccat caccaacctg 
59401 cantangngg annacatgcn ncncccnggn tccaggaagt tcaacaccac ngagngngtn 
59461 ctgcagggtc tgctnnnncc cntnttcaag aacnccagtg tnggccntct gtactctggc 
35 59521 tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgccatc 

59581 tgcanccacc nncntnancc caaaagncct ggactgnaca gngagcngct ntactgggag 
59641 ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 
59701 ctctatgtca atggtttcac ccattggatc cctgtgccca ccagcagcac tcctgggacc 
59761 tccacagtgg accttgggtc agggactcca tcctccctcc ccagccccac aactgctggc 
40 59821 cctctcctgg taccattcac cctcaacttc accatcacca acctgcagta tggggaggac 

59881 atgggtcacc ctggctccag gaagttcaac accacagaga gggtcctgca gggtctgctt 
59941 ggtcccatat tcaagaacac cagtgttggc cctctgtact ctggctgcag actgacctct 
60001 ctcaggtccg agaaggatgg agcagccact ggagtggatg ccatctgcat ccatcatctt 
60061 gaccccaaaa gccctggact gnacagngag cngctntact gggagctnag ccanctgacc 
45 60121 aanimcatcn nngagctggg nccctacacc ctggacaggn acagtctcta tgtcaatggt 

60181 ttcacccatc nganctctgn gcccaccace agcactcctg ggacctccac agtgnacntn 
60241 ggnacctcng ggactccatc ctccntcccc ngccncacan cimctgnccc tctcctgntn 
60301 ccnttcaccn tcaacttnac catcaccaac ctgcantang nggannacat gcimcncccn 
60361 ggntccagga agttcaacac cacngagngn gtnctgcagg gtctgctnnn ncccntnttc 
50 60421 aagaacncca gtgtnggccn tctgtactct ggctgcagac tgacctnnct caggncngag 

60481 aagnatggng cagccactgg antggatgcc atctgcancc accnncntna ncccaaaagn 
60541 cctggactgn acagngagcn gctntactgg gagctnagcc anctgaccaa nnncatcimn 
60601 gagctgggac cctacaccct ggacaggnac agtctctatg tcaatggttt cacccatcag 
60661 acctttgcgc ccaacaccag cactcctggg acctccacag tggaccttgg gacctcaggg 
55 60721 actccatcct ccctccccag ccctacatct gctggccctc tcctggtgcc attcaccctc 

60781 aacttcacca tcaccaacct gcagtacgag gaggacatgc atcacccagg ctccaggaag 
60841 ttcaacacca cggagcgggt cctgcagggt ctgcttggtc ccatgttcaa gaacaccagt 
60901 gtcggccttc tgtactctgg ctgcagactg accttgctca ggcctgagaa gaatggggca 
60961 gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactgnac 
60 61021 agngagcngc tntactggga gctnagccan ctgaccaann ncatcimnga gctgggnccc 

61081 tacaccctgg acaggnacag tctctatgtc aatggtttca cccatcngan ctctgngccc 
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61141 accaccagca ctcctgggac ctccacagtg nacntnggna cctcngggac tccatcctcc 
61201 ntcccczigcc ncacagcccc tgtccctctc ttgataccat tcaccctcaa ctttaccatc 
61261 accaacctgc attatgaaga aaacatgcaa caccctggtt coaggaagtt caacaccacg 
61321 gagagggttc tgcagggtct gctcaagccc ttgttcaaga gcaccagcgt tggccctctg 
5 613 81 tactctggct gcagactgac cttgctcaga cctgagaaac atggggcagc cactggagtg 

61441 gacgccatct gcaccctccg ccttgatccc actggtcctg gactggacag agagcggcta 
61501 tactgggagc tgagccagct gaccaacagc gttacagagc tgggccccta caccctggac 
61561 agggacagtc tctatgtcaa tggcttcacc cagoggagct ctgtgccaac caccagtatt 
61621 cctgggacct ctgcagtgca cctggaaacc tctgggactc cagcctccct ccctggccac 

10 61681 acagcccctg gccctctcct ggtgccattc accctcaact tcactatcac caacctgcag 

61741 tatgaggtgg acatgcgtca ccctggttcc aggaagttca acaccacgga gagagtcctg 
61801 cagggtctgc tcaagccctt gttcaagagc accagtgttg gccctctgta ctctggctgc 
61861 agactgacct tgctcaggcc tgaaaaacgt ggggcagcca ccggcgtgga caccatctgc 
61921 actcaccgcc ttgaccctct aaaccctgga ctggacagag agcagctata ctgggagctg 

15 61981 agcaaactga cccgtggcat catcgagctg ggcccctacc tcctggacag aggcagtctc 

62041 tatgtcaatg gtttcaccca tcggaacttt gtgcccatca ccagcactcc tgggacctcc 
62101 acagtacacc taggaacctc tgaaactcca tcctccctac ctagacccat agtgcctggc 
62161 cctctcctgg tgccattcac cctcaacttc accatcacca acttgcagta tgaggaggcc 
62221 atgcgacacc ctggctccag gaagttcaat accacggaga gggtcctaca gggtctgctc 

20 622 81 aggcccttgt tcaagaatac cagtatcggc cctctgtact ccagctgcag actgaccttg 

62341 ctcaggccag agaaggacaa ggcagccacc agagtggatg ccatctgtac ccaccaccct 
62401 gaccctcaaa gccctggact gaacagagag cagctgtact gggagctgag ccagctgacc 
62461 cacggcatca ctgagctggg cccctacacc ctggacaggg acagtctcta tgtcgatggt 
62521 ttcactcatt ggagccccat accgaccacc agcactcctg ggacctccat agtgaacctg 

25 62581 ggaacctctg ggatcccacc ttccctccct gaaactacan cnnctgnccc tctcctgntn 

62641 ccnttcaccn tcaacttnac catcaccaac ctgcantang nggannacat gcrmcncccii 
62701 ggntccagga agttcaacac cacngagagg gttctgcagg gtctgctcaa gcccttgttc 
62761 aagagcacca gtgttggccc tctgtattct ggctgcagac tgaccttgct caggcctgag 
62821 aaggacggag tagccaccag agtggacgcc atctgcaccc accgccctga ccccaaaatc 

30 62881 cctgggctag acagacagca gctatactgg gagctgagcc agctgaccca cagcatcact 

62941 gagctgggac cctacaccct ggatagggac agtctctatg tcaatggttt cacccagcgg 
63001 agctctgtgc ccaccaccag cactcctggg actttcacag tacagccgga aacctctgag 
63061 actccatcat ccctccctgg ccccacagcc actggccctg tcctgctgcc attcaccctc 
63121 aattttacca tcactaacct gcagtatgag gaggacatgc atcgccctgg ctccaggaag 

35 63181 ttcaacacca cggagagggt ccttcagggt ctgcttatgc ccttgttcaa gaacaccagt 

63241 gtcagctctc tgtactctgg ttgcagactg accttgctca ggcctgagaa ggatggggca 
63301 gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactggac 
63361 agagagcggc tgtactggaa gctgagccag ctgacccacg gcatcactga gctgggcccc 
63421 tacaccctgg acaggcacag tctctatgtc aatggtttca cccatcagag ctctatgacg 

40 63481 accaccagaa ctcctgatac ctccacaatg cacctggcaa cctcgagaac tccagcctcc 

63541 ctgtctggac ctacgaccgc cagccctctc ctggtgctat tcacaattaa cttcaccatc 
63601 actaacctgc ggtatgagga gaacatgcat caccctggct ctagaaagtt taacaccacg 
63661 gagagagtcc ttcagggtct gctcaggcct gtgttcaaga acaccagtgt tggccctctg 
63721 tactctggct gcagactgac cttgctcagg cccaagaagg atggggcagc caccaaagtg 

45 63781 gatgccatct gcacctaccg ccctgatccc aaaagccctg gactggacag agagcagcta 

63841 tactgggagc tgagccagct aacccacagc atcactgagc tgggccccta caccctggac 
63901 agggacagtc tctatgtcaa tggtttcaca cagcggagct ctgtgcccac cactagcatt 
63961 cctgggaccc ccacagtgga cctgggaaca tctgggactc cagtttctaa acctggtccc 
64021 tcggctgcca gccctctcct ggtgctattc actctcaact tcaccatcac caacctgcgg 

50 64081 tatgaggaga acatgcagca ccctggctcc aggaagttca acaccacgga gagggtcctt 

64141 cagggcctgc tcaggtccct gttcaagagc accagtgttg gccctctgta ctctggctgc 
64201 agactgactt tgctcaggcc tgaaaaggat gggacagcca ctggagtgga tgccatctgc 
64261 acccaccacc ctgaccccaa aagccctagg ctggacagag agcagctgta ttgggagctg 
64321 agccagctga cccacaatat cactgagctg ggccactatg ccctggacaa cgacagcctc 

55 64381 tttgtcaatg gtttcactca tcggagctct gtgtccacca ccagcactcc tgggaccccc 

64441 acagtgtatc tgggagcatc taagactcca gcctcgatat ttggcccttc agctgccagc 
64501 catctcctga tactattcac cctcaacttc accatcacta acctgcggta tgaggagaac 
64561 atgtggcctg gctccaggaa gttcaacact acagagaggg tccttcaggg cctgctaagg 
64621 cccttgttca agaacaccag tgttggccct ctgtactctg gctccaggct gaccttgctc 

60 64681 aggccagaga aagatgggga agccaccgga gtggatgcca tctgcaccca ccgccctgac 

64741 cccacaggcc ctgggctgga cagagagcag ctgtatttgg agctgagcca gctgacccac 
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64801 agcatcactg agctgggccc ctacacactg gacagggaca gtctctatgt caatggtttc 
64861 acccatcgga gctctgtacc caccaccagc accggggtgg tcagcgagga gccattcaca 
64921 ctgaacttca ccatcaacaa cctgcgctac atggcggaca tgggccaacc cggctccctc 
64981 aagttcaaca tcacagacaa cgtcatgaag cacctgctca gtcctttgtt ccagaggagc 
5 65041 agcctgggtg cacggtacac aggctgcagg gtcatcgcac taaggtctgt gaagaacggt 

65101 gctgagacac gggtggacct cctctgcacc tacctgcagc ccctcagcgg cccaggtctg 
65161 cctatcaagc aggtgttcca tgagctgagc cagcagaccc atggcatcac ccggctgggc 
65221 ccctactctc tggacaaaga cagcctctac cttaacggtt acaatgeiacc tggtctagat 
65281 gagcctccta caactcccaa gccagccacc acattcctgc ctcctctgtc agaagccaca 

10 65341 acagccatgg ggtaccacct gaagaccctc acactcaact tcaccatctc caatctccag 

65401 tattcaccag atatgggcaa gggctcagct acattcaact ccaccgaggg ggtccttcag 
65461 cacctgctca gacccttgtt ccagaa^agc agcatgggcc ccttctactt gggttgccaa 
65521 ctgatctccc tcaggcctga gaaggatggg gcagccactg gtgtggacac cacctgcacc 
65581 taccaccctg accctgtggg ccccgggctg gacatacagc agctttactg ggagctgagt 

15 65641 cagctgaccc atggtgtcac ccaactgggc ttctatgtcc tggacaggga tagcctcttc 

65701 atcaatggct atgcacccca gaatttatca atccggggcg agtaccagat aaatttccac 
65761 attgtcaact ggaacctcag taatccagac cccacatcct cagagtacat caccctgctg 
65821 agggacatcc aggacaaggt caccacactc tacaaaggca gtcaactaca tgacacattc 
65881 cgcttctgcc tggtcaccaa cttgacgatg gactccgtgt tggtcactgt caaggcattg 

20 65941 ttctcctcca atttggaccc cagcctggtg gagcaagtct ttctagataa gaccctgaat 

66001 gcctcattcc attggctggg ctccacctac cagttggtgg acatccatgt gacagaaatg 
66061 gagtcatcag tttatcaacc aacaagcagc tccagcaccc agcacttcta cctgaatttc 
66121 accatcacca acctaccata ttcccaggac aaagcccagc caggcaccac caattaccag 
66181 aggaacaaaa ggaatattga ggatgcgctc aaccaactct tccgaaacag cagcatcaag 

25 66241 agttattttt ctgactgtca agtttcaaca ttcaggtctg tccccaacag gcaccacacc 

66301 ggggtggact ccctgtgtaa cttctcgcca ctggctcgga gagtagacag agttgccatc 
66361 tatgaggaat ttctgcggat gacccggaat ggtacccagc tgcagaactt caccctggac 
66421 aggagcagtg tccttgtgga tgggtattct cccaacagaa atgagccctt aactgggaat 
66481 tctgaccttc ccttctgggc tgtcatcctc atcggcttgg caggactcct gggactcatc 

30 66541 acatgcctga tctgcggtgt cctggtgacc acccgccggc ggaagaagga aggagaatac 

66601 aacgtccagc aacagtgccc aggctactac cagtcacacc tagacctgga ggatctgcaa 
66661 tgactggaac ttgccggtgc ctggggtgcc tttcccccag ccagggtcca aagaagcttg 
66721 gctggggcag aaataaacca tattggtcgg aaaaaaaaaa aaaaa 

35 SEQ ID NO. 3 

hk5 amino acid 

MATARPPWMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDDSSSRIINGSD 

CDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLSPVYESGQQMFQGVKSIPHPG 

40 YSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSGWGTTKSPQVHFPKVLQCLNISVLSQKR 

CEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGPWCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQET 
IQANS 



45 SEQ ID NO. 4 
KLK5 CDS 

ggtgtctgtg cgtcctgcac ccacatcttt 
tgctagactc ctatcttctg aattctatag 
gcccgtcctt gtggttcctc tctacttggg 

50 aagacccccc tggatgtggg tgctctgtgc 
agagcatgtt ctcgccaaca atgatgtttc 
tgggagcaac caggacctgg gagctggggc 
cagccgcatc atcaatggat ccgactgcga 
gttgctaagg cccaaccagc tctactgcgg 

55 cacggccgcc cactgcagga agaaagtttt 
accagtttat gaatctgggc agcagatgtt 
ctactcccac cctggccact ctaacgacct 
tcccactaaa gatgtcagac ccatcaacgt 



ctctgtcccc tccttgccct gtctggaggc 
tgcctgggtc tcagcgcagt gccgatggtg 
gaaatcaggt gcagcggcca tggctacagc 
tctgatcaca gccttgcttc tgggggtcac 
ctgtgaccac ccctctaaca ccgtgccctc 
cggggaagac gcccggtcgg atgacagcag 
tatgcacacc cagccgtggc aggccgcgct 
ggcggtgttg gtgcatccac agtggctgct 
cagagtccgt ctcggccact actccctgtc 
ccagggggtc aaatccatcc cccaccctgg 
catgctcatc aaactgaaca gaagaattcg 
ctcctctcat tgtccctctg ctgggacaaa 



wo 2004/075713 PCT/CA2004/000281 

26/47 



gtgcttggtg tctggctggg 
ccagtgcttg aatatcagcg 
gatagatgac accatgttct 
ttctgggggg cctgtggtct 
5 cccttgtgcc cggcccaaca 
gatccaggaa accatccagg 
cccacctgct gcagggacag 
gttgagaatg ttcatctctc 
cccccacatt gggctgaccg 
10 tccagggcgg gggttgcgtc 
catcccttct ctgcagctct 
aaaaaaa 



ggacaaccaa gagcccccaa 
tgctaagtca gaaaaggtgc 
gcgccggtga caaagcaggt 
gcaatggctc cctgcaggga 
gaccgggtgt ctacacgaac 
ccaactcctg agtcatccca 
ccctgacact cctttcagac 
cagcccctga ccccatgtct 
tgtctctcta gttgaaccct 
tcaatctccc tggggcactt 
gacccaaatt tagtcccaga 



gtgcacttcc ctaaggtcct 
gaggatgctt acccgagaca 
agagactcct gccagggtga 
ctcgtgtcct ggggagatta 
ctctgcaagt tcaccaagtg 
ggactcagca caccggcatc 
cctcattcct tcccagagat 
cctggactca gggtctgctt 
gggaacaatt tccaaaactg 
tcatcctcaa gctcagggcc 
aataaactga gaagtggaaa 



15 SEQ ID MO. 5 

KLK5 nucleic acid 

gggcccagag tgaaggcaag agaaggagtt 
tcccctgcct aaaatgcagg gagagggagg 

20 aagaaagaga gagagagaga gagacagaat 
acagagagcc tgggacacag ggacacacag 
acacaaatgg agacacagag gtgtaaagaa 
aaaggggcag aagcacagtt ttcagggtgg 
tttttttttt tttttgagac ggagtctcgc 

25 gatctcggct cactgcaagc tccgcctccc 
ccaagtagct gggactacag gcgcccgcca 
gtagagacgg ggtttcaccg ttttagccgg 
gcccgcctcg gcctcccaaa gtgctgggat 
atcatcttct tgactatgct gatgtgacaa 

30 aatatgcagt ttgggccagg caccgtggct 
agaggtgggt gaatcacttg aggccaggag 
ctctgtcttt actaaaaaaa aaaaaaaaaa 
acctgtaatc ccagctatgc tggaggctga 
gaggttgcag tgggccgaga tcacatcacc 

35 ctgtctcaaa taaataaata aacaaacgaa 
aaaaaaaaaa tgctgtcaac aaatagagca 
agaactctaa ggtatatttg acaaatcatt 
ggcatagaaa gacagggagg aacagggaga 
acaaggctcc taagacagac aggaggagag 

40 aaaaagacag agagagagag acagagacag 
gagagagggg tggagagaga cacgagatat 
gaaccacaga gagatggaag aagactctga 
agtatcgagg gtgaacagac agtggtggaa 
tccaggcgcc aagaatagtg acccagagtt 

45 aggcagggaa ggggctggcc tggcttccgg 
gtagggagtg acattccgga ctgggtgggg 
ggaggagcta ttgctaaggc ccgataggca 
cagtgggtgg ttataactca ggcccggtgc 
aggcacaggc ctgagaagtc tgcggctgag 

50 gggacagggc aagtgagacc tggtgagggt 
gtgcgtcctg cacccacatc tttctctgtc 
ctcctatctt ctgaattcta tagtgcctgg 
cttgtggttc ctctctacct ggggaaataa 
gctccccgga tcgcctgggc ctcccaaccc 

55 atggctacag caagaccccc ctggatgtgg 
ctgggggtca caggtaacca gaactctggg 
ctctgcggca ctagagcgcc tgtcccctgg 
gaccgggtga atgtgagtct ctgtctgtac 



gagagctccc tctgcaaagt ggcttgagtc 
cagaaagaca gggaagagga aggggtgggg 
aacacaacta cagaaacaca gagagaacac 
agtcagagag aaaagagaag atagagaaag 
agagagatta acagagtccc agatacacgc 
tgtctatgat catcttcttt tttttttttt 
tctgtcgccc aggctggagt gcagtggcgg 
gggttcacgc cattctcctg cctcagcctc 
ctacgcccgg ctaatttttt tgtattttta 
gatggcctcg atctcctgac ctcgtgatcc 
tacaggcgtg agccaccgcg cccggccatg 
gtacctaaag ccatcagact ctacccttta 
catgcctgta attccagcac tttgggaggc 
tttgagacca gcctggccaa catggtgaaa 
aaaaaaaatc agccgggtgt cgtggggcac 
ggcacgagag tcacttgaac cctggaggcg 
gccctccagc ctgggcgaca gagcaagact 
caagcagttt gttgtacctt agttatatct 
gaagtgaaat aaaggaaaat aaatgggcca 
cagaaccttt aaaaaagaaa gaatcacaga 
cagaaacacc tgtggcccaa ggagaacaaa 
agagagagag tgagtgagag acagacagag 
agagacagag aggcgagagg gatagaaaga 
tgagagagac tcagaaagat agccgaggga 
gaaaaaacca gagacaaaga tggaaagagg 
tgagcaaaat gcagagaaga aagcaagcaa 
ggtgagaagc cagatcctta aggctggggg 
agacccctcc ccattctccg ggccagggag 
ggtgctctgg gggtggagat agggggagca 
cctcattgcc cgggaatgtg ccccagggag 
ccagagccca ggaggaggca gtggccagga 
ctgggagcaa atcccccacc ccctacctgg 
ggctcagcag gcagggaagg agaggtgtct 
ccctccttgc cctgtctgga ggctgctaga 
gtctcagcgc agtgccgatg gtggcccgtc 
ggtaggggag ggaggggaag tgggttaagg 
tctgacattc cccatccagg tgcagcggcc 
gtgctctgtg ctctgatcac agccttgctt 
gtgggagggt tgtgggattg ggaggactgt 
ggaactgtgt gagcctgggc atgactccgg 
ttgtggttgt gcgatcgtat gtggccctgt 
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gactgccacg gtgtgtgtcg gggaggggga 
caggtggcac tgaccctttg aggctgtgtg 
attgtgtgtg gctccacagc tgtgtgggtg 
gtgtttggct gtgtgtggtg acttggcatt 
5 tccctgaggt cccgggattg cgtgcaacaa 
gtgtgctgct tgcaggcgat tatgtgattg 
tttgtgaccg tgtgactacc tgaagctctg 
tctgtgtgag gccgtgtaaa tgctactgta 
ttctgtctct gcctggaggg atagagggtg 
10 caggtgactg acttgcagtg tgtgcctgtg 
tgtgcacaca cggcatctgt gcgtggcact 
gctaggctgc ccgggagcgt gtgtacctgg 
gaggcaacat gggcgtgtct gcagaactgc 
gcgtggttct tggggtgagt tcgtgaatga 
15 gaaccaggcc gggcgcggtg gctcacgcct 
ggcggatcac ctgaggtcgg gagatcgagg 
ctactaaaaa tacaaaaaat tagctggtgt 
gggagactgg ggcagaaaaa tcgcttgaac 
tcgcgccatt gcactccagc ctgggcaaca 

20 gaaaaaaaaa agggtaagaa ccagtgaatg 
atgcatgtag tctgtaggtc tgtgtgtgag 
gttttcatct gagaattcag aaacctaggc 
ctgagccctt ctttcctggt cctgctttcg 
ccacctcctt tcctcaacca cgcccctagg 

25 cacccctttg ggccaggctc caccccctat 
agtcagagct tttttttttt tttttttgga 
tgcagtggcg tgatctcggc tcactgcaac 
gcctccacct cctgagtagc tgggattaca 
gtgtctttag tagagacagg gtttcacctt 

30 tcaggtgatc cgcccacctc ggcctcccag 
ccctagccca aagtcagagc tctttatagg 
ctaactaagt caattccaaa ccccttcctg 
tgaccccact tcttgagacc agttccatcc 
gctccagccc ccacagcttt ggcactaccc 

35 tttaccctca catgtagttc tagccaattc 
gtaaccctac ctgagcctgg gctctgtcct 
ctcttattct ccaggccctg cccctgcccc 
ggtctggcct cttgagtctg aaacccaccc 
caacccattt tccgttccca gagcatgttc 

40 cctctaacac cgtgccctct gggagcaacc 
cccggtcgga tgacagcagc agccgcatca 
agccgtggca ggccgcgctg ttgctaaggc 
tgcatccaca gtggctgctc acggccgccc 
gaggagggtt ggtggggacg gggaagtggg 

45 tcatggaggt gagggctggt ggggacgggg 
gggttggtgg ggatgggttg gggatgtggg 
taaggatgga gttttgcggg ggagcaaggt 
tgtggtaggg aatgggaagg agccaaggat 
tgttgaatgg tttgggatgg aggtggaatt 

50 aaatcgggct ggggtggaaa tgaagatagc 
atagaatgaa ggatggggat tggagttttg 
tgagaatgca tatggtgatg gcttctgggt 
ggtttggaat tgtgactggg atggggacag 
ggatggtttg gggaccgggg gtggggatgg 

55 gattggcgtt ggacgtggag atagagatca 
agagttttca gagtccgtct cggccactac 
cagatgttcc agggggtcaa atccatcccc 
aacgacctca tgctcatcaa actgaacaga 
atcaacgtct cctctcattg tccctctgct 



tgccttttcc catatcaggt gactgtgcgg 
tgtggttttg tgattgtgtg tgcatttaag 
aatgcatgta gcactggggg tgttcactgt 
gtatatgact gcaggtatct gcagttcctg 
aagtggtcat caccatggaa agctgtgact 
tggctgagtg tgacgttatg gatgcccgta 
tgtaggggtg actgtatgtg actgtgtgtg 
tgtgtgatgg tgcagctgtg tgtctggagt 
caggggtagc tatctctggg agatgggtgc 
tgcagaagag tatgtggcag tctgaacatc 
gagacactgt ggatgagggt gtgcgatccc 
agacagagct gtatgttagc tgcacctgtg 
gtgcgtgctt ggctgttact gctgttgtgc 
tggtggtgcc agggccatca gcaagggtaa 
gtaatcccag ccctttggga ggccgaggca 
ccagcctgac caacatggag aaccccgtct 
ggtggcgcgt gcctgtaatc ccagctactc 
ccgggaggtg gaggttgcgg tgagccgaga 
agagcgaaac tccgtctcga aagaaaaaaa 
ggcacgggag gactgatgat ggagtggggc 
aggaggagat tgacaggatt gagaaggcat 
ctgctcttcc cctccatgtg gccccctaag 
gaaccctagc tccgcccatg agctctgacc 
ccagactcta gtggaccccg cctaaggcca 
tctgtgggta ccttctagaa cccccttcaa 
gacagtcttg ctctctctcc caggctggag 
ctctgcctcc caggttcaag tgattctcgt 
ggtgcgcgcc accacgcctg gctaattttt 
gttggccagg ctggtctcaa actcccaacc 
agtgctgggg ttacaggcgt gagccaccgc 
agactctaac atgtaaccct gaccctggcc 
cctccagccc tgaccccact cactgaggcc 
ctaaagccct ggtctccctc ccatccccag 
ctgagcttgt ccaggaatcc tgtacccaat 
caggaatctg tgaggtccag ttagagtcca 
tgagcttgag cctgggcttg agaggtgcca 
ctcagcatgt cagacaccca ccctctagct 
ccagcccaag ccccgcctct gagccccgcc 
tcgccaacaa tgatgtttcc tgtgaccacc 
aggacctggg agctggggcc ggggaagacg 
tcaatggatc cgactgcgat atgcacaccc 
ccaaccagct ctactgcggg gcggtgttgg 
actgcaggaa gaagtgagtg ggagttccaa 
ggtgggggtg gggaagtggg ggtgggggtg 
aagtggggtt gggggtgtca tggaaggtga 
agcaggagga ggtcgagttg gggataggac 
gggaggatga ggttggagag gggagagtgt 
gggttggatt tggggttagg agcatatatt 
gggattggct ttagaattgg gggtgggtga 
atggagatag ggttgagatt gggagcagat 
ggtggggttg gagatggttg gatttgggct 
agggaaagaa ttagggttgg gaatgggatg 
gcatgggatt ggagaccaag agggagttga 
gggtggggct ggggctgggt gtggggttgg 
gggttggtgg tgacctgccc catcttcctc 
tccctgtcac cagtttatga atctgggcag 
caccctggct actcccaccc tggccactct 
agaattcgtc ccactaaaga tgtcagaccc 
gggacaaagt gcttggtgtc tggctggggg 
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acaaccaaga gcccccaagg tgagtgtcca 
gccttccatc tttctccact tctcattgtg 
cctccagtgc ttgaatatca gcgtgctaag 
acagatagat gacaccatgt tctgcgccgg 
5 gaggacacct ctctttattc agcagataca 
ttgccaaatt ctgagaatcc agcaattgcc 
ctcataccct agagtagtgg tgtttagtag 
agttttttag tagccacatt aaaacaggta 
taatcccagc actttgggag gctgaggcag 

10 tagcctggcc aacatggcga aactctgtct 
tggtggcggg cgcctgtaat ctcagctgct 
cccaggaggt ggaggttgca gtgagctgag 
gagtgacact tttgtctcaa aaagaaaaaa 
taactttaat aacccaatgt atcccaaata 

15 caattatgaa tgagatactt tacattcttt 
agtatatatg ttatgctgac agcacatctc 
gtagccacat gtggctagca gttactgtat 
gggctgtttt gtatggttgg gcaggttgtg 
gcactccgtg ttacagatgt cagttttggc 

20 tgtttcaaca aaatctgtaa tatgacagtt 
gaaggaaaag agaaatctgg taggtatttt 
ttgcaaagct gctggaaggg ctggaggaac 
agaatctgca taaatagggc aatttcagag 
tggttttagg atagtaaaca ataagggcca 

25 ttggagaggt ggcatttgag cagagaatgg 
taaggggaaa gaaaaggcac gtgcaaaggc 
aagaagaaga ggaaaccaat gcaactggag 
cgctggaggt gtaggcaggg gcgaatgctc 
cttccctatg ttctaatgga agctgtatct 

30 gttacatcaa ccagcaccct tctctgtatt 
gttaacaagc tctcattagc agggtgtgtg 
aggagtactc cagtcccatg gctatgaaaa 
tgcaacacct ccccagctct ccccatttct 
tgcgaggggg aaaactttta acagaagaaa 

35 cctgtaatcc caacactttg ggaggccgag 
accatcctgg ctgacacggt gaaaccctgt 
cgtggtggca ggcgcctgta gtcccagcta 
aacccgggag gcggaacttg cagtgagccg 
acacagtgag actccgtctc aaaaaaaaaa 

40 aagtggtggc atttaaaact atttagcctt 
cagacctcaa ggtgtttttt tgtttgtttt 
actaaaagct acaagcaaga aataataaca 
aaataatagc atctggctaa ttgctggaca 
attaactcat ttacctgtta ttattggccc 

45 gcagttaact aacagcctct caaaagaaac 
gagagaaatt aaaccacaag aaagttgaaa 
ctttgaaaca gtgtctgcta ctgggaaaaa 
ccaggactct gtaattcata ttttgcatgc 
ggcacatgcc agtaatccca gcactctggg 

50 ggagttcaag accagcctgg gcaactaaaa 
attttagtag attttattca taccacttac 
cttttctttt cttttctttt cttttttgag 
gtgcaatggc accatatcag ctcactgcag 
cacctcagcc tcccaagtag ctgggataac 

55 ttttttccgt agagatgggg ttccaccatg 
ccagtgatct gcctgcctcg gcctcccaaa 
cccaggtggg agatagacat ttctctctac 
cattttcttc ataaatatta gccgagtggc 
gtggatatgg catcaggcaa aacagaccaa 



ggttcttctt gataccgacc catctctgcc 
ttcctgtttg acagtgcact tccctaaggt 
tcagaaaagg tgcgaggatg cttacccgag 
tgacaaagca ggtagagact cctgccaggt 
cactgagtgc caactcggta acatggagcg 
aagacagtca ggacccctgt tctcacagag 
aaataatgct gagctgctta tgtcatttcc 
aaaaaggctg ggcgcagtgg ctcacacctg 
gcagatcacc tttggtcagg agtttgagac 
ctaaaaaaaa atacaaaaat tagcctggca 
caggaggccg agacacaaga atcacttaaa 
atcgtgccac tcactccaac ctgggagaca 
aaaaacaagt aaaaaagaaa caggtgaagt 
caatcatttc aaagtgtaat taatataaaa 
tcttgttttc atattaagtc tttgaaagtg 
aatttggact agctacattt caggtgctca 
tggatggcac ggatctagag ggaaagatca 
cactgcataa agataccata tctaataggg 
agttttcagg cgtgtggtag ttaagtgtct 
ttctagcaag tgctggtaaa atatcttgag 
tacaagagaa tatttaatac aggggattaa 
aaagttaaaa aataaaaaac tctgtggtca 
agtggtaaag gttaacccca aaataaaaca 
atattcaaaa aggtggtcag gggagcctcc 
atgacacaaa gaagctaaac tcgtgaagtt 
cctgaggcag taaggaattt ggctgattca 
aacaaaagtg ggggcaacag tagaaagtga 
tgcaagtatt tcttggtcac caacacagag 
gttgaggaag acagaattta aaatcaaact 
caggctccca agggatctag aaggacgtaa 
tttcaacagt agttaggaag ctggggattc 
gctcccccca aattgtacaa acctgacaaa 
tctctgtgcc ctgggtgtgg gggggtgggt 
gcacatctcg gccgggcgtg gtggctcaca 
gcgggtggat cactaggtca ggagatggag 
ctctactaaa aacacaaaaa attagccggg 
ctcgggaggc tgaggcagga gaatggcctg 
aggttgcacc actgcactcc agcctgggca 
aaagaaaaga aaagaaatca catctcattc 
tctgtaggca aggttagtat cttgtttttc 
ttcataccgg tgtgtggtct gggtgtggcc 
actacaacaa tactaatacc aatagtataa 
ctgttttaag tggtttgcat gcctcagctc 
tattttacaa acaaggagcc aaggctcaga 
tctgcagaga tattaaattt aaaaaataat 
tttagaggta caggcagcta agcttgtttg 
ggcaagtctt ggctttccta ataattgata 
atgtaagtaa gaaatgaagc cgggtgcaat 
agactgaagt gggaagatca cttgagctca 
attaaaaaaa taaaaatact aattgttttt 
atcattattg tagtatgtac atatttattt 
acggagtctc gctctgtcac ccaggctgga 
catgcgcctc ctgggttcaa gcatttcttc 
aggcacccac caccatgcct ggctattttt 
ttggccaggc tggtcttgaa ctcctgacct 
ttgctggtat tacaggtgtg agccaccgtg 
ctcaaacaga ggtccactca agctactttt 
tattttgcac caggaatggt tccaggtgct 
aaacttcctg ccgcgtggac ctcatgttcc 
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ccaagtggaa gacaggcaat aaagagatag ataaatatgt agtaaattaa aaaaaaaaaa 
aattagccgg gtgtggtggc ttgcacctgt agttccagct acttgggagg ctgaggtggg 
agaattgctt gagcccaaac gtttgaggct gcggtaagcc atgactgcac tgctgcactc 
cagacagcag cctgggtgac aaagcaagac gtttttgtca gaaagaaaaa aaaaagagac 
gaagggagga aggagagaga aaggaaggaa ggaaggagaa agaaaggaag gaaggagaaa 
gaaaggaagg aaggaaggag aaagaaagga agaaagagaa agaaagaaaa agaaagaaag 
aaagaagaaa gaaaagagag aggaaggaag gaaagaagga aaagagggaa aaaaatgact 
gttgaagagc agtgagtatt attataggag ggtaattata gggaggtatg gggaattgaa 
gacaggaaac acaaattagt ccaagcgaat ggatttctat tgggagtgat tctgccccta 
gaagacactg gcaataccag gagacatttt tggttgtcac aactatatgg aggggcatta 
ctggcaacta atggatagat gccaagtgtg ctgttcaaca tgctatgatg cacacggcag 
gcctccacaa caaaccatta tccagcttca gatgcccaca gtgcccagat cgaggaaccc 
tcatccaggg gctgagaacc gtatttttgc agaagggagg tataaggatg ggttggtgga 
gaatggggaa ggaaggtgtg tgtccagtaa gagaaataag gcctgcacag gctggagggg 
agagtgagag agaaagggag gcggagagat acacgatgag ggagacaggc tggaacagaa 
agtagagacg aagattcgag atgtggagag gaagggtcac agaccccccc gaaatgatgt 
gtggacaaca ggaatctgga agaggaagat ggagtggaga gtgacaaatg gggtctaaag 
gttgaacttg gaggccaggc atggtggctc acgcctgtaa tcccaacact ttggaggctg 
aggtgggcga atcacttgag gccaggagtt cgagaccagc ctggccaaca tggtgaaacc 
20 ccgtctctac aaaaaaaata caaaaaatta gccgggtgtg gtgatggaca cctgtagtca 
cagctacttg ggaggctgag gcaggagaat tgcttgaacc cgggagatgg aggctgcagt 
gagctgaggt caggccactg cgctccaacc tgggcaacag agtaagactc catctcaaaa 
aaaaaaaagc tggatttgga gtgaaatatt aataacattc tccctctctc tccttttgcc 
tgtgtctcca tctctgtctt tttctgcatt tcttcatctc tgtactttcc atctctgtgt 
25 gtctgttccc atctgcttct ccatctatgg gcatctctgg gtctctcatg tctccttctg 
cccactttgc cacatctctg cctctctcat gccccccttt ctctcctgca gggtgattct 
ggggggcctg tggtctgcaa tggctccctg cagggactcg tgtcctgggg agattaccct 
tgtgcccggc ccaacagacc gggtgtctac acgaacctct gcaagttcac caagtggatc 
caggaaacca tccaggccaa ctcctgagtc atcccaggac tcagcacacc ggcatcccca 
cctgctgcag ggacagccct gacactcctt tcagaccctc attccttccc agagatgttg 
agaatgttca tctctccagc ccctgacccc atgtctcctg gactcagggt ctgcttcccc 
cacattgggc tgaccgtgtc tctctagttg aaccctggga acaatttcca aaactgtcca 
gggcgggggt tgcgtctcaa tctccctggg gcactttcat cctcaagctc agggcccatc 
ccttctctgc agctctgacc caaatttagt cccagaaata aactgagaag 

35 

SEQ ID NO.' 6 

hk6 amino acid 

MKKLMVVLSLIAAAWAEEQNKLVHGGPCDKTSHPYQAALYTSGHLLCGGVLIHPLWVLTAAHCKKPNLQVFL 
GKHNLRQRESSQEQSSVVRAVIHPDYDAASHDQDIMLLRLARPAKLSELIQPLPLERDCSANTTSCHILGWG 
KTADGDFPDTIQCAYIHLVSREECBHAYPGQITQNMLCAGDEKYGKDSCQGDSGGPLVCGDHLRGLVSWGNI 
PC GSKEKPGVYTNVCRYTNWIQKTIQAK 

SEQ ID NO. 7 
KItK6 nucleic acid 

CDS 147. . 881 

gtcgacccac gcgtccggct ggctggctcg ctctctcctg gggacacaga ggtcggcagg 

cagcacacag agggacctac gggcagctgt tccttccccc gactcaagaa tccccggagg 

cccggaggcc tgcagcagga gcggccatga agaagctgat ggtggtgctg agtctgattg 

ctgcagcctg ggcagaggag cagaataagt tggtgcatgg cggaccctgc gacaagacat 

ctcaccccta ccaagctgcc ctctacacct cgggccactt gctctgtggt ggggtcctta 

tccatccact gtgggtcctc acagctgccc actgcaaaaa accgaatctt caggtcttcc 

55 tggggaagca taaccttcgg caaagggaga gttcccagga gcagagttct gttgtccggg 

ctgtgatcca ccctgactat gatgccgcca gccatgacca ggacatcatg ctgttgcgcc 

tggcacgccc agccaaactc tctgaactca tccagcccct tcccctggag agggactgct 
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cagccaacac caccagctgc cacatcctgg gctggggcaa gacagcagat ggtgatttcc 
ctgacaccat ccagtgtgca tacatccacc tggtgtcccg tgaggagtgt gagcatgcct 
accctggcca gatcacccag aacatgttgt gtgctgggga tgagaagtac gggaaggatt 
cctgccaggg tgattctggg ggtccgctgg tatgtggaga ccacctccga ggccttgtgt 
5 catggggtaa catcccctgt ggatcaaagg agaagccagg agtctacacc aacgtctgca 
gatacacgaa ctggatccaa aaaaccattc aggccaagtg accctgacat gtgacatcta 
cctcccgacc taccacccca ctggctggtt ccagaacgtc tctcacctag accttgcctc 
ccctcctctc ctgcccagct ctgaccctga tgcttaataa acgcagcgac gtgagggtcc 
tgattctccc tggttttacc ccagctccat ccttgcatca ctggggagga cgtgatgagt 

10 gaggacttgg gtcctcggtc ttacccccac cactaagaga atacaggaaa atcccttcta 
ggcatctcct ctccccaacc cttccacacg tttgatttct tcctgcagag gcccagccac 
gtgtctggaa tcccagctcc gctgcttact gtcggtgtcc ccttgggatg tacctttctt 
cactgcagat ttctcacctg taagatgaag ataaggatga tacagtctcc ataaggcagt 
ggctgttgga aagatttaag gtttcacacc tatgacatac atggaatagc acctgggcca 

15 ccatgcactc aataaagaat gaattttatt atgaaaaaaa aaaaaaaaaa aaaaaaaaaa 
agggcggccg c 

SEQ ID NO. 8 

KXiK6 nucleic acid 

20 inRNA join(2001. . 2185, 3084 3135, 3559 . .3606,4346. .4502, 
8122. .8369,9791. .9927,11805. .12483) 

CDS join {3567.-. 3606, 4346.. 4502, 8122.. 8369, 9791.. 9927, 11805 .. 11957 )■ 

25 acacttaaaa aatcttctga cttaaaaaaa aaagtatggt gattggaaaa tgtaaatgtg 
catgcgtgct tggcatcaca tttcattggc caggacttcc ctggatgcta aaggtcctca 
aatgccaggc tggggggctg ggacttggtc ccaagggaga tggggaccca gggcacgtct 
gtgagaggag gggcaaggtc agcacaaggc acaggaaggt ctctctgggg caagggatac 
agagaacaga gggatcctgg tccaggtggg agaggtgcag ctctgagttg gggttgaggg 

30 tgtgggtaca gagaggaagg gaccccccag agagaggagg cagagggata gggcctggtc 
actgggttgt gcaacatcag acttgctgtc tgtgaagata gcacgtcctg agaagaaggt 
gctgaggtca gtggggacca aatgtgagag ggagcacccg gagagtatac tgaataccga 
agtagtcttc atccctggag tgatgggggg tgcacaatgc aagatgacaa ttagattcaa 
tgcaagacaa agaaaagggt tggctgggaa cagtggctca tgcctatggt cccagctcct 

35 gggaagactg aggcgggagg gtcgcttgag cccaggaggg ttgaggctgc cacgagcaag 
gatcgtgcca ctgcactcca gcctaggcga cagaacaaga ccttgtctca aaagaaaaaa 
gaactttttt ttttaagtta cctgtagtgc ccagcccaag caggtgctga gccagacttc 
attcctatca ttgtccttat tacgcagtga cttccccctc ctcatttctc tccactctgc 
cacgcacaca ccctcaccct ccagcccata ccaaccaccc caaccactgc ctgtggtttc 

40 ccatgtgcac ccaggccagg cattttcacg gcctttcctc ctgacctacg cctggctcag 
ctttctaggc ccaagttcaa agacacctcc ctaaatcttc ccagatccct ctgctactgc 
ccagcaccac catcttatca cagccccacg tcgttcccaa gtgctctccg atttctgctt 
aactccatgc ctctcgctgt gtgtccgcat ctcatcaata agtcctcaag tcctcttcca 
tcctgctagc ttcctcatcg ctcgggaatc atccccgcta cttcctgggg aaactgactc 

45 ccttctgggc acacacagtg ctacccccgg ggaaatctaa gaagagaccc aggagaagat 
aagcacggag agtcagagaa tcaaggggaa agaaagggag agaggccggg cacagtggct 
cacacctgta atccagcact ttgggaggcc aaggtgggtg gatcacctga ggtcaggagt 
ttgagaccag cctggccaac atggtgaaac ctcttcccta ctaaaaatac aaaaaacatt 
tagccgggcg tggtggtggg tgcctgtaat cccagctact tgggaagctg aggcaggaga 

50 actgcttgaa ctcaggaggc ggaggttgca gtgaactgag atcacaccac tgcactccag 
cctgagtgac agagcaagac tccgtcaaaa aaaaagaaag aaagaaagaa aagaaggaag 
gaaagaaaga aggaaggaag gaagggagga agggagagag gaagggagag aggaagggag 
agagagaaaa aaagagggag agagacacaa atacagagac tgagatggga gagagagaga 
gatggaagct ccctcccctc catggccagg gagacagatg gagcaagaga cctcaggggt 

55 gggcagactt ggaggagaag gaccaggagg atgtggagtg ccgaaatctc cagtcagggc 
caggtgggca gtcagagact gcaaaggagg actgtcagac agggacaaaa ggaagccatt 
gatgtaaccg ccctcccgcc tgcccgccgg aagagaggtt gaggccggag ctgctgggag 
catggcactg gggtgctggg aggcggacaa agcccgattg ttcctgggcc ctttccccat 
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cgcgcQtggg cctgctcccc agcccggggc 
tgtagctgtc tccccggctg gctggctcgc 
agcacacaga gggacctacg ggcaggtgtg 
aggctgcttc ccagtgccgg agggctctag 
5 gtggcacaga gagtgctggg ggtgcaggga 
gccggaattt gtcttcagac actttctttc 
gactagaact tcctctgcct cagattcagg 
tttaggtcct agcccctcct ccctcagacc 
gactcaggag tccaggcccc cagcccctcc 

10 ctcctccctc agacccagga gtccaggccc 
caggccccca gcccctcctc cctcagactc 
■ tcagacccag gagtccaggc cctcactgca 
cctggtcagg ggtcaccaag agcagagcgt 
gggtaaggag gaaaagggtg tagccagtct 

15 aaaaggacgt tccagaagca tctggggaca 
ctgggggtgt gtgtctggca gtccctgcag 
cttggctctg ccactgcatc tgagtgtctt 
ctttctgcct cctcgtctca aagctgttcc 
ggaggcctgc agcaggtgag atcacagaca 

20 tggccattgc gcacagagcc aggctccgag 
ggccccctat ggtaaccctc tcctgtcgac 
aaagatctaa tcagaatctc accatgggtt 
ttggcactgt tttcgtggat cctcttggaa 
gctagatggg aaggacagag gtcaaggcag 

25 ggatgcagag ggagcagaca gagggatggg 
tgtggcttcc cctctcagga gcggccatga 
ctgcaggtgg ggaaagggca tttggatggg 
atggagaaga ggctggtatt ggggatgggg 
gaaatgagga agacgttggg gattaggcta 

30 gggaggtggg tttgaagata tgagggtttg 
aaacatagaa gaggtaggag gtaggttgga 
cagttgggtt tgtaatggga atggggtaag 
ttttttgaga cagggtctca ctctgtcacc 
ttcactgcag acttgaactc ttgggtctca 

35 gctgggacta caggcgtatg ccaccatacc 
gtgtgtgtgt gtgtgtgtgg agatgaggtc 
cctgggctca agcgatcctc ctgcctcagc 
accaatcttg actggagttc atgttgaggg 
ctgactcaga tcttctctcc ctcagcctgg 

40 ggaccctgcg acaagacatc tcacccctac 
ctctgtggtg gggtccttat ccatccactg 
ccgtgagtct acactgtaaa tgaacagcag 
atgtcaggca ggaggtgaca taggcatccc 
caggtgcatt cggctgttgc ttaattgagt 

45 gcagtggaaa agaaaataaa aaaaagaaaa 
ctgaacttac tttctaatgg gggaattgga 
atttggattt aattctgagc acagtaggaa 
tttgtttttt gagacacagt ctcgctctgt 
cagctcactg caacctctgc ctcccaggtt 

50 tagctgagat tacaggtgtg caccaccttg 
cggggtttca ccatgttggc caggctggtc 
cctcgccctc ccaaagagct gggattacag 
tacattttta caagcaccct ggctaccacg 
ggaggcccac gtgggggctg ttgctttcat 

55 ggcggtcgca gtggggatgg agggatgttg 
agccagcaga atctggcaac gaggaacagg 
tgtatttgtc ctgaacaact gggtgttttg 
aaagagaaac aggccgggtg taggcagggg 
agttggagat gccggggaga tgtcccagca 



aggggcgggg gccagtgtgg tgacacacgc 
tctctcctgg ggacacagag gtcggcaggc 
tgagtcaccc caaccgcact gaacctgggc 
agcccggagt gagggcctgc aggtccctgg 
ggcctggggc accatctgct tgccccagag 
tccaaaaccc ggaggtctaa ggactgagcc 
ccccagcccc tcctccctca gacccaggag 
caggagtcca agttcccacc tcctccctca 
tccctcagac ccaggagtcc aagttctcac 
caagcccctc ctccctcaga cgcaagggtc 
aggagtccag gcccccaagc ccctcctccc 
ctcagggacc agtgctccct tccctggagg 
gggggcggga ggaatgtgtg tgggaggcct 
cctggctcag ggacctgaga gacaggggtt 
gaaccagcct cttccaggga ggcctgggag 
ccctgggctc tgcggcccct gcgtcctccg 
ctctcctcac ggctccccgc atttctaact 
ttcccccgac tcaagaatcc ccggaggccc 
tcacagaacc tgccgggtgg gcggggtggg 
gaaaactccc atacagagga agaacgctag 
aggaaggcaa atcagtgccc aagaaagtag 
actggaccag tggacgtagt ^tgaattctct 
gatgtgggct gaggaagaat aaatcaggag 
gagaccatag caggccagga aggaaggaga 
gggagggtcg aggcagtgac taatggacca 
agaagctgat ggtggtgctg agtctgattg 
ggaggcttgc agacagggtt gggcttgttg 
atatgcacag ggttggggtg ggggagcttt 
agggtgggga atacagatag ggagggtggt 
gggtggggtt ggctttaggg atggggatct 
aagttggaga gagcccggga ataggggata 
tttgggagtg gaaatacaga gaagcttttt 
caggctggag tgtagtggca tgatccatag 
agtgaccctc ccacctcagc ctcccaagta 
ctgctaattt gtgtgtgtgt gtgtgtgtgt 
tcactgtgtt accgaggctg gtctcaaact 
tgggattaca ggcataagcc actgcacctg 
ggatgcgctt ggtttctcca gaactcctct 
gcagaggagc agaataagtt ggtgcatggc 
caagctgccc tctacacctc gggccacttg 
tgggtcctca cagctgccca ctgcaaaaaa 
atgcgactga accctgaggg tgtcttatag 
ccccatccca gcacgaggcc atctgatagc 
acttaatgtg tgccaggccc tgcgggcata 
caaaaaaaaa caagcaaaat tgctgttttc 
tcatttgggg acctgcaggg cgtgatgggc 
gccactgggc agttttgttt ttgttgtttg 
cacccaggct ggagtgtagt ggcatgatct 
ccagcgattc tcctgcctca gcaccccaag 
cctggctaat ttttgtatgt ttggtagaga 
tcgaactcct gacctcaggt gatccgcccg 
gcatgagcca ccaccacacc cagcctgatt 
tggaacgtgg tctgggcaag agagagggag 
ccggcgacat aggagggtgg cttgaaccca 
aatatcttgg gatgtggaat tctgagactg 
agggagagga agaagcacgg ctggcttccg 
ccacgtcttt ctctgagttg tgggagaggg 
agcatctgac attttgcttt agccacgatg 
gggaggccag ggaggactct ggagctcaga 
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ggagaggtca gggctggagg ttaaaatgaa 

ccatgggact agatgagatc atccaaaaag 

ggacaaaaac cctgggcgct gatcctcact 

gatgctaact accaatcagg tgctgagtga 

5 accacaaggg actcttggca ccattttgca 

aagtgacaat ccctggggtg gttcccctga 

ccatcattca ttcctttgat gtacattgac 

ttgggcagtg gagattcagc aatggatggg 

caaggacaga aaggtgcaga caagcaaagt 

10 atcccagcac tttgggaggc cgaggtgggt 

aacatggctc aaccctgtct ctactgaaaa 

cttctgtaat tccagcaact tgggaggcta 

ggaggttgca gtgagccgag atcgcgccac 

tctgtctaaa aaaaaaaaaa ggaaagaaag 

15 catgcctgta atcccagcac tttgggaggc 

tacaaagctg cagtgagctg tgatctacag 

gagcgagacc ctgtctcaaa aaaacaaaca 

tataattaca aattatgaag gaaaagaata 

ggaaggactt tctaatgaga taaaatccaa 

20 tcagggcaga ggaaaggctg tgataacacc 

agaaaataaa atttcccgtt cactgggggg 

tgactacagc cagatcacac aggggctcca 

ttaggacaat ggggagccat gggtgatgtc 

atatgtatca aacacctatc ctgtgccagg 

25 tgaatagaac aaaaatcccc atcttgacat 

atcagccaca tatagcaaat tacattgaaa 

ttcaagtact cagcagccac ctgtagcttg 

gatcgagatc atggcatcgt agcatttagt 

aacacaatga gtaaatattt aacaataaat 

30 ctctggtgga acagaaagca ggggagggag 

ttttaattgg gcaactaagg aaggcttccc 

gtgagctatg cagatacttg gaggacagac 

ccctgaggtg ggaagatcac tattgtgttc 

ggagcagagg gagagaaggg gagagtggga 

35 tgcttgcaag gcctggtgtg ccacgttgag 

agtcatagga ggggctgagc agaggaggca 

tggttgct'tt gtggaggatg gactgtgggg 

gaggctactg ctctagttca ggtaggaagt 

gtgggaaagg tgagatgtgg ccagattctg 

40 ggacagcttg gatgtagggc atgaaataaa 

tgaaaggatg gaattgccat ttacccagct 

attcatgact tcccagccct ctctgaagcc 

cccagccctc ttccttccca ggaatcttca 

aagggagagt tcccaggagc agagttctgt 

45 tgccgccagc catgaccagg acatcatgct 

tgaactcatc cagccccttc ccctggagag 

catcctgggc tggggcaaga cagcagatgg 

tactggctac ttggggaagt gtgccaaagg 

tgggaagatg ggctaatggt gaggaccaat 

50 gggggagggsi gagtgaattt gggagctggg 

gaccaatggg tgaatagcat gggagagatg 

aaggtcagtg gggagatgct aatcaggaag 

attcattgaa cagcaggaag gaataatgga 

aaagcacaaa agccaactga aggatgtgaa 

55 gaagagggac taaggggaaa ggatcaatgg 

tccaatagat cagcaggatc catgaaggtg 

tgaaccattg gatgaagggc cagtgggaag 

tagaaaagga ccaatgaggg aggtggacca 

cagtggggga tggtgaggcc agttagaaaa 



ggcatcgtca gcaaacaggt gtatttaaag 
ctggcatagt tggaggagct ggagggccca 
agtcagattc acgacagctg ccacttgttt 
aaccatgtac acacctttcc tggaatgccc 
aatgaggaaa ctgaggtgca gggaaatagc 
ccccaaggag accttggatg actctcacca 
taagagcacc tgctaagtgc cacattcgag 
acacacacgt catccctgcc ctcgggagca 
gagggctggg catggtggct cacgcctgta 
ggattacctg agttcgagac cagcttggcc 
tacaaaaaat tagccaggcg tggtggtggg 
aggcaggaga attgcttgaa cgtgggaggc 
tgcactccag cctgaaccac agagcgagac 
aagcagcaaa ttgggctggc cgtggtggct 
cgaggcgggt ggatcactcg agcccaggag 
aacaccactg cagatccagc ctgggtgaca 
aacaaaagaa gcaaaccctt caaaacccca 
cgggtaccta ctttagatgg aggagggtca 
gcggaggcat gaagatggga aaaggaatgt 
cctgaggtga gaaccgtctt gagtattctc 
cagaaggtgc tgggagataa ggttggaaag 
gtgccaagtg gaggagccca ggctttattc 
tgagcaaggg agtgactctc tgtt'tcagga 
tgctgatcaa cgcactggag atactatatc 
cctagagctg cactgtctaa tatggtagcc 
ttaatgaaat- ggaaaatcca caagccacat 
tggttccccc agccacctct ggacagtgca 
ggacagcatt gctctgcaag gaggagaaat 
atatagoagg tcggatgatt gtgataggtt 
ataggaattg cctactaaca ggtatttgta 
tgagaggcga catttaaagg aagtgaggga 
ttgctggcag agggaacagc agtgcaaagg 
aaggcaagac agggaagcca gcgtttggct 
ggagaagatg tctgtgagat gatggggcag 
aactttggct ttgattctga gtgagatggg 
caggaccaac ttacattgtt aaaatatctc 
gaccagagac agagcaggga gcccagtgag 
gaaaaggcag ctcaaaccaa gatggtagcc 
gatatgcttc agagaggcaa aaggaattct 
gagagtgaag aatagccccc aagattattc 
ggggaagact gtgggaggag caggccagcg 
tcaactgcag cccaagggct ccaggtgaga 
ggtcttcctg gggaagcata accttcggca 
tgtccgggct gtgatccacc ctgactatga 
gttgcgcctg gcacgcccag ccaaactctc 
ggactgctca gccaacacca ccagctgcca 
tcagtagtgg gaggctggtg gggagcaggc 
atggggagtg ggaaaattgg tgaggggcca 
gggacagggt ttcaatggga gaaaggtcaa 
ccagtgagtg aacagccaat ggaaaatgta 
gaacataaga tgaaggttca ataaagaggg 
gatgtcaaag gtcaaagggg actgatcagg 
gaaggaactg atggaagaag agaaaccaat 
ttgagacagt gaatgggggt atagctgatg 
tccagaggag tcactagagg aaaaaacagg 
ggcctgtgtg tgaagggcca ataagaaagg 
gcagagacaa tgggggagga tgcggcaagt 
ttggatgaag ggctaatagg aagggagagc 
ggaccaagga gggaagcaga ccaataggaa 
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gagagagcca atgagggagg gcagggccag 
attggaggaa gggccaatag aaagggagga 
aggaccaatg atggaggtgg accattggat 
gagagggcat ggccagttag gaaaagacca 
5 caatgggcag gaagtgtcca atgaagaatg 
agggcgtaac agaggaagag tcctccaggt 
gtggaagaga gaaaagtgga ggagggacct 
gactcctgga gaagagacta ttaatgagga 
aagagggacc aattaggagg cagggacgat 

10 aggaagaggg gtgccaatag aaaagaggga 
ggggggtgac tggggaggat gaggggagtg 
cccctaacag gtgatttccc tgacaccatc 
gaggagtgtg agcatgccta ccctggccag 
gagaagtacg ggaaggattc ctgccaggtg 

15 agggacagga cgaagtcaca aaaacatggc 
aagagagctt tacagagaca gatagagaca 
agagacttag ttcaacacac agagacacag 
ccagcagaga caggaagtgc agagacaagg 
gccaggagca gcggctcatg cctgtaatcc 

20 cacctagggt caggagttcg agaccagcct 
aaaatacaaa aattaggatg ggcacagtgg 
gccgaagcag gaggatcacc tggggtcagg 
aaccctatct ctactaaaaa tacaaaaatt 
cccagcacct tgggaggccg aagcaggagg 

25 ctggccgata tggtgaaacc ctatctctac 
caggcgcctg tagtcccagc tactcaggag 
ggcggaggtt gttgcagtga gtcgagatca 
aagattccgt ctcaaaaaaa aaccaaaaaa 
ctgtagtccc agctactcgg gaggctgagg 

30 J ggctgcagtg agctgagatc acgccactgc 
tctaaaaaca aaaagaacca aagagaagta 
ccttcctcaa acagagcccc cacgagtcct 
agacactagc tggggaaagg ggactccctc 
gtcatccatc caggctctcc tctttatgcc 

35 agaccaacca agggggagac acaggcagaa 
acagggaaag cgatacatag caagttggac 
ctcaagacac gaggtggaga ggtgtccctg 
gggccctccc tacctctccc acctgggtct 
ctcctcttcc tcctcctcct cctccctcat 

40 gtctctacac ctctgcctct ctccacacct 
tcttgctctc tatgttcctc tgcatcttgg 
ttattctctc tctaccattc tctctctgtg 
tctctctgtc cctgagtctt tctctccatc 
tctctctctg tcacacacac acacacacac 

45 tctgggtttc tatctgtatc tgactttctc 
cgctggtatg tggagaccac ctccgaggcc 
caaaggagaa gccaggagtc tacaccaacg 
ccattcaggc caagtgaccc tgacatgtga 
ctggttccag aacgtctctc acctagacct 

50 ccctgatgct taataaacgc agcgacgtga 
ctccatcctt gcatcactgg ggaggacgtg 
ccccaccact aagagaatac aggaaaatcc 
cacacgtttg atttcttcct gcagaggccc 
cttactgtcg gtgtcccctt gggatgtacc 

55 atgaagataa ggatgataca gtctccataa 
cacacctatg acatacatgg aatagcacct 
ttt 



ttaggaaagg accaatgagg aaggtagacc 
tccatgaggg agggtgggga cagttagaaa 
gaagaaccaa tagaaaggaa gaaccaatgg 
atggtcacag agtgaccaat caagatgaat 
gactactgat caggaggggt acagtagagg 
caactgaaac tactgaagaa ggtgggacca 
aagagaaaag gaaaaccaat aggaaatgag 
agacagccaa tgggggggaa gaatgataga 
ggtaatgaga tgtaagaatg agagacaaac 
ccaatagagg atggaggact tataggggtt 
caaggcctgg gctgagtctg gcccatctct 
cagtgtgcat acatccacct ggtgtcccgt 
atcacccaga acatgttgtg tgctggggat 
aggtgacccg gatctgccac ttacacagcc 
cagacacagg aagagagaga cacaggccaa 
ggctgaggga gaacccaagc cttgaaaaga 
tcagggatat gcagagatat aaagacacag 
atggaggccg cgggatcaag aaccagagag 
cggcactttg ggaggccgaa gcaggaggat 
gatcaacatg gtgaaaccct atctctacta 
ctcatgcctg taatcccagc accttgggag 
agttcgagac cagcctgatc aacatggtga 
aggatgggca cagtggctca tgcctgtaat 
atcacctggg gtcaagagat tgagaccagc 
taaaaataca aaaattagct gggcctggtg 
gctgtggcag gagaatcact tgaacctgga 
tgctactgca ctccagcctg gcaacagagc 
caaaaattac gcaagcatgg tgggacacac 
ctggagaatt gcttaaaccc aggaggcaga 
actccagcct ggggacagag ccagactctg 
gtaaggaagc agatggtgtg aggggactgt 
gctcagaaac gaccaggctc tggaggaggg 
ccgaatactt taacttgggt ttcctccatt 
agaatgacta atgcactgag ggatgtgcag 
aeggagacac aggcagaaac agggacagag 
gcaaagaaag ggcaggtggg cgagactgtc 
gacagaatag tgccaggcat atctctccct 
tatcgtctcc tcctccccct cctccctctc 
catcttcttc ttttctctct ctctccatcg 
ctcagtctcc attcttaaat tgtttctctt 
cattcctatc tctgtgtctt tgagtctcct 
cctttgtgtg tcttactgtc tctctctctg 
tttcagtaag tacctctgtc cctttctacc 
acacacacac acacacacac acacacagtc 
cctctttcct gcagggtgat tctgggggtc 
ttgtgtcatg gggtaacatc ccctgtggat 
tctgcagata cacgaactgg atccaaaaaa 
catctacctc ccgacctacc accccactgg 
tgcctcccct cctctcctgc ccagctctga 
gggtcctgat tctccctggt tttaccccag 
atgagtgagg acttgggtcc tcggtcttac 
cttctaggca tctcctctcc ccaacccttc 
agccacgtgt ctggaatccc agctccgctg 
tttcttcact gcagatttct cacctgtaag 
ggcagtggct gttggaaaga tttaaggttt 
gggccaccat gcactcaata aagaatgaat 
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SEQ ID NO. 9 

KLK6 nucleic acid 

5 CDS 246. .980 

aggcggacaa agcccgattg ttcctgggcc ctttccccat cgcgcctggg cctgctcccc 
agcccggggc aggggcgggg gccagtgtgg tgacacacgc tgtagctgtc tccccggctg 
gctggctcgc tctctcctgg ggacacagag gtcggcaggc agcacacaga gggacctacg 

10 ggcagctgtt ccttcccccg actcaagaat ccccggaggc ccggaggcct gcagcaggag 
cggccatgaa gaagctgatg gtggtgctga gtctgattgc tgcagcctgg gcagaggagc 
agaataagtt ggtgcatggc ggaccctgcg acaagacatc tcacccctac caagctgccc 
tctacacctc gggccacttg ctctgtggtg gggtccttat ccatccactg tgggtcctca 
cagctgccca ctgcaaaaaa ccgaatcttc aggtcttcct ggggaagcat aaccttcggc 

15 aaagggagag ttcccaggag cagagttctg ttgtccgggc tgtgatccac cctgactatg 
atgccgccag ccatgaccag gacatcatgc tgttgcgcct ggcacgccca gccaaactct 
ctgaactcat ccagcccctt cccctggaga gggactgctc agccaacacc accagctgcc 
acatcctggg ctggggcaag acagcagatg gtgatttccc tgacaccatc cagtgtgcat 
acatccacct ggtgtcccgt gaggagtgtg agcatgccta ccctggccag atcacccaga 

20 acatgttgtg tgctggggat gagaagtacg ggaaggattc ctgccagggt gattctgggg 
gtccgctggt .atgtggagac cacctccgag gccttgtgtc atggggtaac atcccctgtg 
gatcaaagga gaagccagga gtctacacca acgtctgcag atacacgaac tggatccaaa 
aaaccattca ggccaagtga ccctgacatg tgacatctac ctcccgacct accaccccac 
tggctggttc cagaacgtct ctcacctaga ccttgcctcc cctcctctcc tgcccagctc 

25 tgaccctgat gcttaataaa cgcagcgacg tgagggtcct gattctccct ggttttaccc 
cagctccatc cttgcatcac tggggaggac gtgatgagtg aggacttggg tcctcggtct 
tacccccacc actaagagaa tacaggaaaa tcccttctag gcatctcctc tccccaaccc 
ttccacacgt ttgatttctt cctgcagagg cccagccacg tgtctggaat cccagctccg 
ctgcttactg tcggtgtccc cttgggatgt acctttcttc actgcagatt tctcacctgt 

30 aagatgaaga taaggatgat acagtctcca tcaggcagtg gctgttggaa agatttaaga 
tttcacacct atgacataca tgggatagca cctgggccgc catgcactca ataaagaatg 
tatttt 



35 

SEQ ID NO. 10 

hk7 amino acid 

MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQ 
40 VALLSGNQLHCGGVLVNERWVLTAAHCKMNEYTVHLGSDTLGDRRAQRIKASKSFRHP 
GYSTQTHVNDLMLVKLNSQARLSSMVKKVRLPSRCEPPGTTCTVSGWGTTTSPDVTFP 
SDLMCVDVKLISPQDCTKVYKDLLENSMLCAGIPDSEOCNACNGDSGGPLVCRGTLQGL 
VSWGTFPCGQPNDPGVYTQVCKFTKWINDTMKKHR 

45 

SEQ ID NO. 11 
KLK7 nucleic acid 
CDS 16. .777 

50 ggatttccgg gctccatggc aagatccctt ctcctgcccc tgcagatcct actgctatcc 
ttagccttgg aaactgcagg agaagaagcc cagggtgaca agattattga tggcgcccca 
tgtgcaagag gctcccaccc atggcaggtg gccctgctca gtggcaatca gctccactgc 
ggaggcgtcc tggtcaatga gcgctgggtg ctcactgccg cccactgcaa gatgaatgag 
tacaccgtgc acctgggcag tgatacgctg ggcgacagga gagctcagag gatcaaggcc 

55 tcgaagtcat tccgccaccc cggctactcc acacagaccc atgttaatga cctcatgctc 
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gtgaagctca atagccaggc caggctgtca tccatggtga agaaagtcag gctgccctcc 

cgctgcgaac cccctggaac cacctgtact gtctccggct ggggcactac cacgagccca 

gatgtgacct ttccctctga cctcatgtgc gtggatgtca agctcatctc cccccaggac 

tgcacgaagg tttacaagga cttactggaa aattccatgc tgtgcgctgg catccccgac 

5 tccaagaaaa acgcctgcaa tggtgactca gggggaccgt tggtgtgcag aggtaccctg 

caaggtctgg tgtcctgggg aactttccct tgcggccaac ccaatgaccc aggagtctac 

actcaagtgt gcaagttcac caagtggata aatgacacca tgaaaaagca tcgctaacgc 

cacactgagt taattaactg tgtgcttcca acagaaaatg cacaggagtg aggacgccga 

tgacctatga agtcaaattt gactttacct ttcctcaaag atatatttaa acctcatgcc 

10 ctgttgataa accaatcaaa ttggtaaaga cctaaaacca aaacaaataa agaaacacaa 
aaccctcaa 

SEQ ID NO. 12 
KIiK7 nucleic acid 

15 

mRNA 

join (1756. .1785,3179. .3309,3722. . 3869, 4566 4813, 5129 5265, 7362 . ,8265) 
/product=" stratum corneum chyiuotryptic enzyme" /note="alternatively 
spliced" 

mRNA join{1756. .1785,3179. .3309,3722. .3869,4566. .4813, 
5129. .5265,7362. .7991) /note="alternatively spliced" 

mRNA 

25 join (1821. .1864,3179. .3309,3722. .3869,4566. .4813,5129. .5265,7362. .8265) 
/product^" stratum coimeum chymotryptic enzyme" /no te=" alternatively 
spliced" 

mRNA 

30 join(1821. .1864,3179. .3309,3722. .3869,4566. .4813,5129. .5265,7362. .7991) 
/note-"alternatively spliced" 



20 



35 



CDS join (3237. .3309,3722. .3869,4566. .4813,5129, .5265, 7362. .7517) 



ggcatggtgg tgcacgcctg taatccagct actcaggact ctgaggcagg agaatcactt 

gaacacgggg gagtggaggt tgcagtgagc cgagatcgtg ccattgcact ccagcctggg 

tgacagagcc agagtccatc aaaaaaaaaa aaaaataaga aagattcttc tctcctctat 

gtgtccatgc agtctcatca tttagctacc acttgtaagt aggaacatgc catatctggt 

40 tttctgttcc tgctttagtt tgtaagggta atggcctcca gctccattca cgtccctaca 

aaggacatga tcgtgttctt ttttatggct acgtagtatt caattgtgta tacgtaccac 

attttcttaa tccagtctat cactgatgga catttaggtt gattccctgt gtttgctgtt 

gtcaatagtt ctacaatgaa cgtacgtgtc catgtgtctt taaacagaat gatttatatt 

cctttgggta cacacactgg ggcttatgag agggtggaga gtgggaggaa ggagaggatc 

45 agaaaaaaat aactaatggg tactaggctt aatacctggg tgattaaata atctgtataa 

caaaccccca tggcgcacgt tcacctacgc aacaaacctg cacatcctgc acatgtaccc 

ccgaactgaa aagttaaaaa aagaaaaata aatatttgct tataaattaa taaatgaagc 

cctcaaaaat gttctattag ataatgttaa gtacagacat ttttgttata aatacataat 

atacaaagaa atctatgtat aacatgatta aaatgaccat aagaacatag atcctaaaca 

50 tggcaaatat tagtggggtg gggttaggga aagcgttgtt tttaacttac acctctctgt 

tagagttggg aatgggttca ggcgtaatta caggcacgac tgggatcagc ttggacaagt 

tcccccaggc gggccagaat taggatgtag ggtctaggcc acccctgaga gggggtgagg 

gcaagaaaat ggccccagaa gccgggcgca gtggctcacg cctgtaatcc cagcactttg 

cggggccgag gcgggcacat catgaggtca ggagatcgag accattctgg ccaacatagt 

55 gaaacccggt ctctactaaa aatacaaaaa ttagctggga gtggtggtgc gtgcctgtaa 

tcccaggtac tcgggaggct gaggcaggag aatcacttga acctgggagg cggagctggc 

agtgagccga gatcgcgcca ccgcactcca gcctggcgat agagagagac tccatccaaa 

aaaaagaaag gaagggaggg agggaggagg gaagaaagaa agaaaaccgc cccagagaag 
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gacccgagcc agagcctatt ctctgagctc agcgactgct tgaatcccgc tcctgcccct 
gagacccagc gcaccgggtc cctcccccga gagcagccag gagggactgt gggaccagaa 
. tgtgcggggg cgcaggagct gggcaccgcc cgtccttcgg agggagggtg gagagagagt 
gcagtggtgc caattgctct cgctgcgtca gggttccaga' taaccagaac cgcaaatgca 
5 ggcgggggtg tcccagagtc ggctccgcct gcaccccagg gcgctggggc cgggcatggg 
cfcggggggtg atataagagg acggcccagc agagggatga agattttgga gcccagctgt 
gtgccagccc aagtcggaac ttggatcaca tcagatcctc tcgaggtgag aagaggcttc 
atcaagggtg cacctgtagg ggagagggtg atgctggctc caagcctgac tctgctctcg 
agaggtaggg gctgcagcct agactcccgg tcctgagcag tgagggcctg gaagtctgca 

10 atttggggcc ttttagggaa aaacgaacta cagagtcaga agtttgggtt ccacagggaa 
gggcaagatc ggagcctaga ttcctgggtc tctagggatc tgaagaacag gaattttggg 
tctgagggag gaggggctgg ggttctggac tcctgggtct gagggaggag ggcctggggg 
cctggactcc tgggtctgag ggaggagggg ctgggggtct cgactcctgg gtctgaggga 
ggaggggctg ggggcctgga ctcctgggtc tgagggagga ggggctggga cctggactcc 

15 taggtctgag ggaggaggag ctggggcctg gactcctggg tctgagggag gaggggctgg 
ggcctggact cctgggtctg agggaggagg ggctggggcc tggactcctg ggtctgaggg 
aggaggggct ggggcctgga ctcctgggtc tgagggagga ggggctgggg cctggactcc 
tgggtctgag ggaggagggg ctggggcctg gactcctggg cctgagggag gagggactga 
gacctggact cctaggtctg agggaggagg gactgggacc tggactcctg ggtctgaggg 

20 aggaggagct gggggcctgg actcctgggt ctgagggagg aggggctggg gcctggactc 
ctgggtctga gggaggaggg gttggggcct ggactcctga gcctgaggga ggagggactt 
ggacctggac tcctaggtct gagggaggag gagctggggg cctggactcc taggtctgag 
ggaggcgggg ctgggggcct ggactcctgg gtctgaggga ggaggggttg gggcctggac 
tcctgagcct gagggaggag ggacttggac ctggactcct aggtctgagg gaggaggagc 

25 tgggggcctg gactcctagg tctgagggag gaggggctgg gggcctggac tcctgggtct 
gagggaggaa ggtgctaggg tctggactct tgggtatgag ggaggaggag gttaggggtc 
tggacttctg agtgtaagga aggagaggcc agagaaagga atttctgggt ctgagggagg 
sggggctggg gttctggacc cctaggtctg agggaggagg ggctggggcc tggacccctg 
ggtctgaggg aggaggggct ggggccggta ctcctgggtc tgtgggggga ggggctgggg 

3D cctggacccc tgggtctgag tggggagggg ctgggcctga atgctttctc cttctcagct 
ccagcaggag aggcccttcc tcgcctggca gcccctgagc ggctcagcag ggcaccatgg 
caagatccct tctcctgccc ctgcagatct tactgctatc cttagccttg gaaactgcag 
gagaagaagg tgaaagctgg actgggaagt ctgacctcac ctcagggccc ccactgaccc 
tctccaagga gtccctgagt cagaaccctt ccctcctcaa acagcttcca tcctgggagg 

35 accagactgt cggctgaagc ccccgctctt cctgcttctg ctgactcagg gggtctctgt 
cccctccagg ccctgcctcc tgtgctcagg gtctctctgt ggttccccag atgagatgcg 
cctcctgggt ttctgagtgg gctccttctg tctgtctcta tccctatctc ttgctttctc 
tgtatttctc cacacatttt catctgtctc tgtccatctc tgactctggg aatccctgag 
gtgcagcctc agccttcccc taatgctagc tacccacgtg ctcctccatg tctccatcca 

40 gcccagggtg acaagattat tgatggcgcc ccatgtgcaa gaggctccca cccatggcag 
gtggccctgc tcagtggcaa tcagctccac tgcggaggcg tcctggtcaa tgagcgctgg 
gtgctcactg ccgcccactg caagatgaag taggtgccgc ccaagtctct gctggaggtg 
caccagcgtc tccagctcgc tatgggggtg gaagggcagt ctttctgtgc ctacggctct 
attctcctct ctctgggtct ctgtcctcct ctctctgggc ctctgtaccc cctctccctg 

45 gggctctgtc cccctctctc cctggctctc tgtctccctc tctctgggtc tctgtccccc 
tctctctgga tctctgttcc cctctctctg tgtctctgtt ccccattctc tctaggtctc 
tgttccccct' cctctctctc tgggtctctg tccctctctc tctggatctc tgtccccctc 
tctctctggg tctctgttcc cctctctctg ggtctctgtc ccctctcctc tctctgtgtc 
tctctccccc ctcctctctc tgtgtctctg tcccccctcc t.atctctgtg tctctctccc 

50 ccctcctctc tctgggtctc tgtccccccc tctctgggtc tctgtctccc tctctctggg 
gctctgtccc cctctctctc tggatctctg ttcccctctc tctgggtctc tgtctcccct 
cctctctctg tgtctctgtc ccccctcctc tctctgggtc tctgtcccca ccccgtcccc 
caggtctttg cacaccctct ctgtcacagt gtctcttctg aatctgtgaa tgtcactcct 
cgcagtgagt acaccgtgca cctgggcagt gatacgctgg gcgacaggag agctcagagg 

55 atcaaggcct cgaagtcatt ccgccacccc ggctactcca cacagaccca tgttaatgac 
ctcatgctcg tgaagctcaa tagccaggcc aggctgtcat ccatggtgaa gaaagtcagg 
ctgccctccc gctgcgaacc ccctggaacc acctgtactg tctccggctg gggcactacc 
acgagcccag atggtaggtg gcctcagtga cccaggagtg caggccccag ccctcctccc 
tcagacccag gagtccaggc ccccagcccc tcctccctca gacccaggag tccaggcctc 
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agcccctcct ccctcagacc 
gagtccagac cccagcccct 
ccctcggaac caggagcctg 
tgacagctct ccctgctcct 
gtcaagctca tctcccccca 
atgctgtgcg ctggcatccc 
ccaattcctc cccagtcctg 
agtgactggg taccaagccc 
ccacctcatt ctctgcctag 
gggatgagac agagagttta 
tgaggtgctg gcctcaggcc 
aaatcataat gcaatattta 
aaaatgtcac attttaaata 
tactagcgtg gctcagcaca 
atgcccatca tgcagtttta 
tgtgtattgc agttactgag 
gccctggtcc cggggaaaac 
cttttgtcat cccctctgtt 
catttcattc ttttcctctt 
tgttctctct ccatgccctc 
tctctctcct cccctccctc 
tctctgtctc ctctctggcc 
tcatctctct ccctcatctc 
cttctatctc tctcctctcc 
ttctcctcct ctcttccagt 
acaccttccc cccctttctc 
tctctctctc ttctcttccc 
tctccctcct tcttttccac 
tcttcctcat ctctctttgt 
ctctgtctct ccacacccat 
tgggatggtg agtgttaggg 
ggtgttcccc ttctcccctg 
acagagcccc acactcagaa 
gaaattctca ataatttttg 
aaattatgta actggtcttc 
cacagggcac gcatccaccc 
gtccttaaca ttggaaaata 
attggtctca ttggccaagg 
actagctctc ccattagtcc 
tccataatct gcaagacaaa 
gatctgaagc caaagttaat 
tggtgtgcag aggtaccctg 
ccaatgaccc aggagtctac 
tgaaaaagca tcgctaacgc 
cacaggagtg aggacgccga 
atatatttaa accaacctca 
accaaaacaa ataaagaaac 
actctcaaac actggaactg 
acaccgagac ccttattcac 
tgaaacaaaa aaaatccaaa 
acagaaatga agtgaaacca 
tctggcttgg cacaacgatg 
aatgcagtga tgcaatcttg 
gtgcttcagc ctcccaagta 
tttgtgtatt tttactagag 
tgacctcaga tgatccaccc 
accacggcca gcccacaatg 
tcagtattat tcaagaacat 
atatatatgt atgtgaccct 



caggagtcca ggcccccagc 
cctccctcag acccagcagt 
aacaacagcc cttctggtcc 
ccctgcagtg acctttccct 
ggactgcacg aaggtttaca 
cgactccaag aaaaacgcct 
ggtaccctgt ctgcatgccc 
ggccttgccc tccccccagg 
gtcaggggtg ggagtttact 
ataggggtga gaaagtgggg 
caaaccttaa gggggcacca 
aaaataaaaa taaaaactca 
aagagcaggt ggatcttact 
gcgctgtact ggcactgtct 
tgtattacat ttgatttcgt 
attttgtgcc tgaagctgat 
actctttctc tccacctcct 
tctgaacagt cttcccacat 
tgttttttct ctgtgttgag 
ctctctgctc tctgtcttct 
tctcctctcc ctgcccccct 
ctctcctctt tctctctctc 
tccttgcccc ctccttttta 
ctgccgctcc cccatctctg 
ctctctctcc tctccccacc 
tttgtctctc tcttctccct 
acaccctccc catctccctc 
ccccatctct ctgtctctct 
ctctctctcc tttccctctt 
cctccttgct cacatctgca 
atagaggaga tgggagagag 
gtgagggcca gtttcatgaa 
gggtctcaaa cttagtctaa 
aacaaagggc cctgcatttt 
accctggtct ccgagaccat 
cttggagatg atgttccttc 
aagagtgctc tgatcctgga 
gtcaaaccag tgtcttcaaa 
ccagagacaa tgagtctctt 
gaccgataac tgaggaatgt 
ctccggctct attccctcta 
caaggtctgg tgtcctgggg 
actcaagtgt gcaagttcac 
cacactgagt taattaactg 
tgacctatga agtcaaattt 
tgccctgttg ataaaccaat 
acaaaaccct cagtgctgga 
gacgttcgta cagtctttac 
cacctttgac ccagtaactc 
atgtagaaca agacttgaat 
tcaaacatgt tccaaaagta 
ttttttttct ttgagacaga 
gctcactgca acctccgcct 
cctgggacta caggtgtgca 
acagggtttc accatgttgg 
accttggcct cccaaagtgc 
atattacaaa cctattaaaa 
ttaggctata ggatgttaaa 
acccataaaa aatgaaatat 



ccctcctccc tcagacccgg 
cctgggcccc agaccctcct 
tcgcccccat cctctctgac 
ctgacctcat gtgcgtggat 
aggacttact ggaaaattcc 
gcaatgtgag accctccccc 
cagggacaga gcttgacccg 
cctggcctcc tcagcttttt 
taggggccaa tgtggccctg 
gtgggaccag ggaaggagac 
aaaacctcag tgattgagat 
tgcagaagtc catgatggac 
gaattttccc ttgccgtaag 
tcatttaaaa tgtggatacc 
taagtactgc attgaagtat 
gactdkctca cctgaccctg 
ctctgttccc tctttctggc 
ctctctttgt gacataattt 
ctagcttgct ctccctccct 
ccctctttct cttgcttctc 
gctctctctt ttttcctctc 
ccccacttct ctgtctctct 
ctgtctctct ctttctcttt 
tctttctttc tctctcttta 
cccaccccat ctctctcccc 
ctttcttctc cacccccatc 
atctctttgt ctgtctctct 
ctctccccat accctttccc 
tcttctccac ctccaactct 
ccttcagctg tcaggggatg 
atgactgtcc tagagaatag 
tgtgcaagct ctgcacggac 
tgcattcctg ctgttgtctt 
cgttttgcac caagtcctgt 
cgtgtccccc tttcctgcgc 
tcccactagc ttggagcagg 
agccccaccc cttctctgca 
ggacctagtg tgtccctagc 
ctcattggct atggtggaag 
atgagaatga gttgggcttt 
gggtgactca gggggaccgt 
aactttccct tgcggccaac 
caagtggata aatgacacca 
tgtgcttcca acagaaaatg 
gactttacct ttcctcaaag 
caaattggta aagacctaaa 
gaagagtcag tgagaccagc 
ggaagacact tggtcaacgt 
taatcttagg aagaacctac 
ttaccatgat attatttatc 
ccagatggct taaataatag 
gtctctgttg cttgggctgc 
cctgggttca agtgattctc 
ccaccacacc aggctaattt 
ccagcatggt cttgaacgcc 
tgggattaca ggcatgagcc 
atgatactta gacagaattg 
tgacaaaagg aaggacaaaa 
tcacagaatc agatctgaaa 
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acacatgtcc 
tttccttgaa 
tctaagattg 
ttttgagccc 
5 tttttcctgt 
ggagggagga 
agcatctcac 
cagtagacag 
ggaagatggt 

10 gatgagggag 
aagatcctct 
acacgccctc 
tccagatcct 
agccctgtca 

15 taacacagaa 
agaccccccg 
tgtttctggg 
gttttagcaa 
aaaagtaaca 

20 ctgctctcct 
gttgattttt 
ggactagctc 
ttttccttta 
aagcacctac 

25 aaacattgt 



cagactgcat 
tgtgcacttt 
gagagaggtg 
tggtcctgcc 
ctgtaaaatg 
gagaacaggc 
gagtgacaag 
gacacagggg 
ggcacctgct 
gagtcctctg 
gtgtggtcac 
ctacctgttc 
tcccctcctt 
tattgcagaa 
aacgcaggag 
ccccaacccc 
cctgtcaagt 
cattttctct 
aaacattgca 
atacgcaagc 
tttaagtaat 
tggatcttgg 
agctcattac 
tttctagggt 



actggggtcg 
tataacatga 
acctttcagg 
ggccctgttc 
ggagagagag 
caacttcatc 
tgaggaggga 
tcccacgggg 
ccccaagaag 
tgactcagag 
acctcagacg 
ttcctgtttt 
atctcatctc 
attctgcagc 
tccaggcccc 
tcctccctca 
ttaagaatgt 
ctcttctgca 
tttgcactaa 
tacaggtaga 
tcatttttct 
cttcttgggt 
ttcccctgtc 
tattacagag 



tcatgaggtg 
aaaataaagg 
aagggagact 
cagggcatat 
gaaaggatgg 
agcgtgggaa 
ggctggcggt 
gtctgccaga 
ggagggaaag 
cctggccaca 
ctgctgaccg 
tctcccagaa 
cctctgagtc 
cgctaattct 
cagcccctcc 
gacccaggag 
caaacatttt 
aggcactcca 
gtcagcctgg 
ttggtttgca 
ttgggtaagc 
tcaaatccca 
cctgttcctt 
attcaataag 



tctccttcct 
tggggaaaaa 
agaaagaaat 
ttccatttcc 
agagaggaag 
ggggtgtgaa 
tttcagaggg 
agtaagcaaa 
gaacctcggg 
gccccagcca 
aggagccact 
ttccctcccc 
tctcctaacc 
gattctccca 
ttcctcagac 
cccaggtccc 
cgaccagtca 
acattcaatc 
agatccctgg 
atgactgaga 
agtatagtgt 
gttctagtcc 
catccttgaa 
ttaatataca 



tctgtgtact 

agtctgaaga 

atgtgcctgg 

cagatctcag 

aaggaaggga 

agtgtttctg 

attgggatga 

cagtgccgga 

aagcgggtag 

tctaacatca 

ccagcccagg 

accaagatcc 

caggcaccac* 

tataggaggc 

ccaggagtcc 

cagccccttc 

ttcccctgaa 

tggaatttta 

ccctggccct 

tggtactaat 

^gtagttaag 

ctacaagcta 

atgggagaaa 

gaaagtgctc 



SBQ ID NO. 13 

Hk8 amino acid 



30 



35 



MGRPRPRAAKTWMFIiLLLGGAWAGHSRAQEDKVLGGHECQPHSQ 
PWQAALFQGQQLLCGGVLVGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQE I PVVQ 
SIPHPCYNSSDVEDHNHDLMLLQLRDQASLGSKVKPISLADHCTQPGQKCTVSGWGTV 
TSPRENFPDTLNCAEVKIFPQKKCEDAYPGQITDGMVCAGSSKGADTCQGDSGGPLVC 
DGALQGITSWGSDPCGRSDKPGVYTNICRYLDWIKKIIGSKG 



SEQ ID NO. 14 
KLK8 nucleic . acid 

40 CDS 35. .817 



gtgaccccgc 
caagacgtgg 
ggaggacaag 

45 cttgttccag 
tacagctgcc 
gaataaagat 
caacagcagc 
ggcatccctg 

50 ccagaagtgc 
cactctcaac 
ggggcagatc 
gggcgattct 
ctcagacccc 

55 ggactggatc 
ccttaataaa 



ccctggattc 
atgttcctgc 
gtgctggggg 
ggccagcaac 
cactgtaaaa 
ggcccagagc 
gatgtggagg 
gggtccaaag 
accgtctcag 
tgtgcagaag 
acagatggca 
ggaggccccc 
tgtgggaggt 
aagaagatca 
ctcacaactc 



tggaagacct 
tcttgctggg 
gtcatgagtg 
tactctgtgg 
aaccgaaata 
aagaaatacc 
accacaacca 
tgaagcccat 
gctggggcac 
taaaaatctt 
tggtctgtgc 
tggtgtgtga 
ccgacaaacc 
taggcagcaa 
tctggttc 



caccatggga 
gggagcctgg 
ccaaccccat 
cggtgtcctt 
cacagtacgc 
tgtggttcag 
tgatctgatg 
cagcctggca 
tgtcaccagt 
tccccagaag 
aggcagcagc 
tggtgcactc 
tggcgtctat 
gggctgattc 



cgcccccgac 
gcaggacact 
tcgcagcctt 
gtaggtggca 
ctgggagacc 
tccatcccac 
cttcttcaac 
gatcattgca 
ccccgagaga 
aagtgtgagg 
aaaggggctg 
cagggcatca 
accaacatct 
taggataagc 



ctcgtgcggc 
ccagggcaca 
ggcaggcggc 
actgggtcct 
acagcctaca 
acccctgcta 
tgcgtgacca 
cccagcctgg 
attttcctga 
atgcttaccc 
acacgtgcca 
catcctgggg 
gccgctacct 
actagatctc 
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SEQ ID NO. 15 

KLK8 nucleic acid 

5 CDS join (<1..39, 418.. 712, 878,.>946) 
Exon <1. ,39 
Exon 418.. 712 
Exon 878. .946 

10 tcttcggttc ccggttactg gcagcagccc cctcctccca caaaagatca ggttccaagc 
ttctcctttt aaaagtactt agaattcagc ccccagctct ctcctccctc acacccagga 
atccaggccc ctagcccctc ctccctcaga cccaggagtc ctggccccta gcagccccct 
cctccctcag acccaggagt ctgggccccc agcccctcct cggtcagacc taaatcccag 
gtcccagtcc ctcctccctt agatttagga gtccaggccc ccagcctctc ctccctcaga 

15 cccaggaatc caggccccca gcctcctccc ctctcagaac taaaatcttg gcccccagcc 
ctttatgttt cagatcgtag agtctcagca ccgagtccct cctctcccta gcctcaggag 
tctgagattc cagcccctcc tccctcaaga tttcacgttc aatcccctcc gccccttctc 
actcacaccc agtgttccag ttcccagaag ctccccaggc tctagtgcag gaggagaagg 
aggaggagca ggaggtggag attcccagtt aaaaggctcc agaatcgtgt accaggcaga 

20 gaactgaagt actggggcct cctccactgg gtccgaatca gtaggtgacc ccgcccctgg 
attctggaag gtgaggtgca gaggtactca gatacagaca tcaggccccg gaccctcctt 
ctccagattc caggacccca gcctcagatg cccttctctg tcgagatcca gcagtctgga 
ccccggcttc ctcctctccc taatttagga gtcccagctc ccagctccct gtcccctcag 
acccagacat cgaggactcc cccctccctt ggaatgtagg aatccagtcc cccagcctcc 

25 tccttcctcc agagaagccc agaacagccc cagatactct cggctgcctc cccagtgccc 
aaatccagaa ctgggagctc aggctcctcc ttcctgttta ccggccccgc cctctccatt 
tcccagacct caccatggga cgcccccgac ctcgtgcggc caagacgtgg atgttcctgc 
tcttgctggg gggagcctgg gcaggtgagg agggttgcgg aggcctccgg aggggaggga 
tctgaaggca gcagtggcgc tggggagtct gtgggaatgc cgcgggggtt atgtgggtgc 

30 gtgtgcacgg atgtgaagag tgcgatacgg tgcaggagcc tctgtgggct ttcctcaggg 
tggacagagg caagaaacag gtagcagcag gtaggagtag gttccgtgat gctgtaaatt 
gtctgaatag ctacagcctt tgggggctgc ttgcttgggg gcatagattc acctgggagt 
actcggggcc tgtagactca tgtggaagca tgtgggggca ttcttgggtg tgtgactctt 
gtatgatgac acatggactg aaatgagtgt ccccgtgtgg cagcgtgtgg aagcctggac 

35 ctcctcacta agttgtatgc ggagaacttg ccgtgtgtcc atttgaaccc acagtggcct 
tcccagacct cgcactgccc cagagggtgg cgatccaacc ctctccctcc tgctgcagga 
cactccaggg cacaggagga caaggtgctg gggggtcatg agtgccaacc ccattcgcag 
ccttggcagg cggccttgtt ccagggccag caactactct gtggcggtgt ccttgtaggt 
ggcaactggg tccttacagc tgcccactgt aaaaaaccgt gagtggatga tgggggcaga 

40 ggtcagctgg ggcttaagga aagagggggc tggggtttcg actcaggaag gagagagctg 
aggactggac tcctgggtct gaaggaggag ggggctgggg gcaatacccc tgcctgggtc 
ccaaactatc cccaccatta caggaaatac acagtacgcc tgggagacca cagcctacag 
aataaagatg gcccagagca agaaatacct gtggttcagt ccatcccaca cccctgctac 
aacagcagcg atgtggagga ccacaaccat gatctgatgc ttcttcaact gcgtgaccag 

45 gcatccctgg ggtccaaagt gaagcccatc agcctggcag atcattgcac ccagcctggc 
cagaagtgca ccgtctcagg ctggggcact ' gtcaccagtc cccgaggtag tgggcttgtc 
cactaatggg agggagagga ggagctggtt ggcccagtgg aacccaagct attggcaaag 
cttggtcccc cagagggaga caaagaaggg aaagtgatca tgatgttgag attcacaagc 
aggagtcata tgagaagcct cgaagatctg actactaaca agagtggtga gagaaagaac 

50 caactagcag attgttaagc agggccagaa aagcccatcc tgtatggcgg agagcactac 
cctatgggac ctttggttgt ataggatttt atataatctt aacacacttg ggatattttg 
gacttctcag aggcccagga aaacaggctc ctaagcacct tctcccccac ctccctcctt 
tttttttttt tttttttttt ttttgcaggg ggacacagtt tcactcaatc gcccaggctg 
gagtgcagtg gcacaatctc agctcactac aatctctgcc tcctgggttt aagcgattct 

55 cgaacctcag cctcccgagt agctgggatt acaggcaccc gccaccacgc gagctaattt 
ttgtatttgt tgtagagaca gggtttcgcc atgttgacca ggctggtctc gaactcctga 
cctcaagtga tctgcccgcc tcggcctacc aaagtgctgg gattacaagc atgagccacc 
gcccctggcc tttttctcct tcttgaaccc aggaaacctg ggccctggtc acctcttccc 
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tcagacccgg gagtctaggc ccttcttctc ccaggaccca gcagtcctag ctccctcttg 

actctggacc ccaaaatctg gacctccaat gaagctgtcc ctttgggact ccagaatcca 

gagagcccct cgccttcctt cacagtgaaa acattgggac tcacctaaga gagtcaagga 

gcttctccag gaagtggcaa agtcagcatt caggtccctg cctgcctcac tcctgctctg 

5 aatgctttgg atgagacagt ttgcggctgt ggaaacacac gtgctcgcaa atcaagtaga 

tcagttcaaa ctatggctct gccctttctc cctgggcaaa ttcctttcca tctctgagcc 

tcactttcct tatctgcaaa atgggaatca ttaaccaata gattttttag ctacgtgagg 

gttcaggtta catcagtttt cttcattgtg ttctcctagt gctcagaatg gtgcctgcta 

catggtagat gctcaataaa tatttgttga atggatgacc tgatgaataa ataatgaaat 

10 gaatgaatgc atctctcctt caaagcgctg ttgtgaggat taaatgagat tatgagcata 

ttgcttttgg cacgtagaga tgccagatgg aagtttttcc ccctcaaaga ggctttggag 

aagtctattc ctcaaaagag gttaataaaa aagatcaatt ccaccttcaa tcattaattc 

aactcttatt tactgagcac ctagtatgcc tgaggtgctg ttgcaggcgc tggatataca 

gccatgagca aactgtacaa agtccttgtc gttatggagt tgcaagctag gtgggagaaa 

15 tagacaataa acaaatacac ataaaataaa acattaggca aagtgctgta aggaaatatt 

gtggagcgta aaaggatagg gagtaatgga gggatggtat tttttaattc gagtggtcag 

ggaaggcttc caggaggagg tgacacttgg aaggagcagg atttagcaca gattgaggac 

ccggttgtct gagggcagga agactggaaa gagagggtca gaggtatcat aaaggggcac 

ctccaagtaa cccccagccc cttgatttga aaatggctca gggtaagaaa aaacacgtga 

20 gatccaaggg cccctctcta gatggagaaa gcccaatagc aacaagtaca gcttcgttta 

atgtggtgag aagtgatgtc ccctgtgcac agtgtcagaa aattccccca tgcagctgga 

aaactccccc attacatcct ggaaagaaag ggggttagat ccagacaggg atggaggcaa 

^gggctgctc tctcagggaa ccttacaacc tcttccccct cagagaattt tcctgacact 

ctcaactgtg cagaagtaaa aatctttccc cagaagaagt gtgaggatgc ttacccgggg 

25 cagatcacag atggcatggt ctgtgcaggc agcagcaaag gggctgacac gtgccaggtg 

agcaatttct gaaatccttc tcctcacaca tccctcattg ccctctcgag gttcaaggct 

tggatggggg tgggggtggt aagggagtga ccccaaagac ttggcacccg gagtgttcac 

ccctatctct acggattggg agccaggttc agagaagcca aactctctct ctgaaagtca 

cactgcatag ggattaggag cattgaattg ctgctgctgt tttctttcga acgcttaact 

30 acaggatggg atgcagagtt gggggctata gagggtgggg tagatgtcca gcaaggagag 

agtctgaggc tagagtttag acgagttgcc tgctctctgg ttcccagcat tacatcgtgg 

aatttgtgcc gactttacca ccggcccggt ggtcagccac ttttctttga gaccgggcta 

gaatcccaag gctggttccc ttctcctgtg attggttcct tgggagacaa tggtgtcttc 

ccagttggct ggagtaaatg gtgccattga cttcagtgct tcctcaatga gctccaatcc 

35 ctttctcttg ggatttgtca tagatataac tctctcttct aactgcaact tcgttgttcc 

tagcactttc ccctggcttt gtccacttcc tggaagcccc accacctttg ccaatgactg 

gtccctaaat acaatgcttt cttccccatt ggccaaaaat ggagtcgttt ccatcagcga 

tactgccatg aaagccagtc tctggattgt tctgtagaga tagtggtctc ttccacaaat 

atttcagcca tggtctcttg gggatatggt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 

40 gtgtgtgtgt gtgtccataa gaatcttgat cctttctcct atgtttggca actgcaatca 

atgatgcctt cactattggc caggaacaga ggaaacttca gctcagtcct ctccaagtaa 

tccctactgt cttctccctg gattggaccc tcgagaactc tttttttttt ttttttgaga 

cagggtctcg ctctgtcccc caagctggaa tgcagtggca caatcttggc tcactgcagc 

ctctgcctcc cagttcaagc aattctccca cctcagcctc ccgagtagct gtgattacag 

45 gtgtgcacca ccacacccag ctagtttttg tatttttagt agagacaggg attcaccatg 

ttggccaggc tggtctggaa cgcctgactt caagtgatct accgcctcgg cctcccaaag 

tgccgggatt gcaggtgtga gccaccaagc ctgtctggga tttcattctt tcccctcttc 

tgtcagtgtt ttgaccacta cccttagaca ccatgtctgt ctgtacatgg aagccccaag 

ccctgtcctg actggtctca ggggacaatg cttttacccc cattggctac aggggaccaa 

50 tcatgccaaa gaactggtaa aacgctggga cagcaggaaa agggacgttg tggacatctc 

agatgcaagg ctgttctcat tctccctgtc tagggcgatt ctggaggccc cctggtgtgt 

gatggtgcac tccagggcat cacatcctgg ggctcagacc cctgtgggag gtccgacaaa 

cctggcgtct ataccaacat ctgccgctac ctggactgga tcaagaagat cataggcagc 

aagggctgat tctaggataa gcactagatc tcccttaata aactcacaac tctctggttc 

55 cttgcctgtc tctgttttgg ccctgtgggg agggctggat ggggatccgg gattgttcct 

gctggcagac taggtgggga tgtgcagaaa ccaagttctc atggtcactt tggccatcac 

cactgcctaa agtgatccct cgttttctgg aagaacttgg gtaagagctt tatttcaggg 

gagaaaatac atacaaaggt cttcaaacat tcagtgggct ggtatgtgaa agacagtttt 

gaaagagttt gtgtttagtt ttcctgagca aagcatttac aagctttgga gataaaattt 
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tcccccttta aaaaataatt 



SEQ ID NO. 16 

5 HklO amino acid 

MRAPHLHLSAASGARALAKLLPLLMAQLWAAEAALLPQNDTRLD 
PEAYGAPCARGSQPWQVSLFNGLSFHCAGVLVDQSWVLTAAHCGNKPLWARVGDDHLL 
LLQGEQLRRTTRSVVHPKYHQGSGPILPRRTDEHDLMLLKLARPVVPGPRVRALQLPY 
10 RCAQPGDQCQVAGWGTTAARRVKyNKGLTCSSITILSPKECEVFYPGVVTNNMICAGL 
DRGQDPCQSDSGGPLVCDETLQGILSWGVYPCGSAQHPAVYTQICKYMSWINKVIRSN 



SEQ ID NO. 17 

15 KLKIO nucleic acid 

Gene 1...1580 
CDS 220...1050 



20 catcctgcca cccctagcct 
tcttggcacc gggacccgga 
ggcgtttcgg gcactgggag 
gacccaggag tgccagcctc 
ctctccgccg cctctggcgc 
25 ctctgggccg cagaggcggc 
tatggctccc cgtgcgcgcg 
tcgttccact gcgcgggtgt 
ggaaacaagc cactgtgggc 
cagctccgcc ggaccactcg 
30 atcctgccaa ggcgaacgga 
gtgctggggc cccgcgtccg 
cagtgccagg ttgctggctg 
ctgacctgct ccagcatcac 
gtggtcacca acaacatgat 
35 gactctggag gccccctggt 
tacccctgtg gctctgccca 
tggatcaata aagtcatacg 
gttatgctcc tgctgatcca 
tcggctgaac tctccccttg 
40 acatctcccc tctcacctca 
aaatgcagga agtggtggca 
agcctctgag agcagttact 
gtgactttgg gcaagccaag 
aacaatgacg tgcctacctc 
45 gtaaatcttc atggtgattg 
aaggttacct gttgtcgtga 

SEQ ID NO. 18 
KLKIO nucleic acid 

50 

Gene 1. .5574 

mRNA join(48. .120,605. .701,2455, .2635, 3589. .3863, 4195. .4328, 4793. .5474) 
CDS join(614. .701,2455. .2635,3589. . 3863, 4195 .. 4328, 4793 4945) 
Promoter 1...47 
55 5'UTR join{48. .120,605. .613) 
exon 48.«.120 



tgctggggac 
gaatccccac 
aagcctgtat 
acccacgcag 
ccgggctctg 
gctgctcccc 
cggctcgcag 
cctggtggac 
togagtaggg 
ctctgttgtc 
tgagcacgat 
ggccctgcag 
gggcaccacg 
tatcctgagc 
atgtgctgga 
ctgtgacgag 
gcatccagct 
ctccaactga 
gatgcccaga 
tctgcactgt 
ttcccccacc 
aaggtttatt 
ggggtcaccc 
tgccctctct 
ttagacatgt 
tcatgtaagg 



gtgaaccctc 
ggaagccagt 
tccagggccc 
atcctggcca 
gcgaagctgc 
caaaacgaca 
ccctggcagg 
cagagttggg 
gatgaccacc 
catcccaagt 
ctcatgttgc 
cttccctacc 
gccgcccgga 
cctaaagagt 
ctggaccggg 
accctccaag 
gtctacaccc 
tccagatgct 
ggctccatcg 
tcaaacctct 
tatccccatt 
ccagagaagc 
aacctgactt 
gaacctcagt 
tgtgaggaga 
cttaacacag 



tccccgcgcc 
tccaaaaggg 
ctcccagagc 
tgagagctcc 
tgccgctgct 
cgcgcttgga 
tctcgctctt 
tgctgacggc 
tgctgcttct 
accaccaggg 
tgaagctggc 
gctgtgctca 
gagtgaagta 
gtgaggtctt 
gccaggaccc 
gcatcctctc 
agatctgcaa 
acgctccagc 
tccatcctct 
gccgccctcc 
ctctgcctgt 
caggaagccg 
cctctgccac 
ttcctcatct 
ctatgatata 
tgggtggtga 



tgggaagcct 
atgaaaaggg 
aggaatctgg 
gcacctccac 
gatggcgcaa 
ccccgaagcc 
caacggcctc 
cgcgcactgc 
tcagggagag 
ctcaggcccc 
caggcccgta 
gcccggagac 
caacaagggc 
ctaccctggc 
ttgccagagt 
gtggggtgtt 
atacatgtcc 
tgatccagat 
tcctccccag 
acacctctaa 
actgaagctg 
gtcatcaccc 
tccctgctgt 
gcaaaatggg 
acatgtgtat 
gttctgacta 
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exon 605...701 

ttggggtcaa aagggaaggt cccgccaggg gtccctgggc agaggatacc agcggcagac 
cacaggcagg gcagaggcac gtctgggtcc cctccctcct tcctatcggc gactcccagg 
tgaagctacc tgcaccccac ccgggttggg gtggattgcg agaggatggg tgggaacccc 
cgggccacag gcaggagccg gcttagagcc tcggtttctc cactgcggga cgcggaagtc 
cccccgctgt gaggttgaga aagaggctcc cactggctcc gagcctcgga tccccacccc 
gcgcgtgaag gagggggaaa cctcgggcgc ggcgtggctg cagccagggt agctggggcg 
cggagagcgc tccactcggg cacagggagg acgggaagat gccgcgaggg gcgtcattag 
ggtaattgtg cccattaccg tgttccagcc cgaatctccc gtctgccagc ccctggggtt 
atcggctgca ggtcaaaagg gtgcttgcgg gctggggtgg cccaggctgg gcggatgagc 
gcggcgaggt gggggtctct aacggagcat ctgttttaac ccgcccctgc acacacccca 
gcagatcctg gccatgagag ctccgcacct ccacctctcc gccgcctctg gcgcccgggc 
tctggcgaag ctgctgccgc tgctgatggc gcaactctgg ggtaaggtgg gggacagggg 
gcggggagag gcgccggtgg gaggcacggg cgggagggca atgtgttccc gtcaccaagc 
cccgcgcacc tctcctcccc cgagacccca gcacccaccc agcgctccgg agcacggccc 
gcccccaagt cagctgggcc cttcttctgg ctcggcccct gggtgacccg ccccactcag 
gccctgtccg atttctgcca cccggatcct cgctcttccg tggactcttc ggcgtgttct 
ttctcctcct cttacgccgc cccatcccgg ccccgctcca ttttacacta ccgtgttttg 
ttttgtttgt ttgtttgttt gagacggagt ctcgctctgt cgcccaggct ggagtgcagt 
ggcgcgatct cggctcactg caagctccgc ctcccgggtt cacaccattc tcctgcctca 
gcctcccgag tagctgggac tacaggcgcc cgccaccaag cccggctagt tttttgtatt 
tttagtagag atgggatttc accgtggtct cgatctcctg gcctcgtgat ccgcctgcct 
cggcctccca aagtgctggg attacaagcg tgagccaccg ctcccggcca caccacagtg 
ttttatcctg agtcttgcct taccgctttt tgccctctcc cctcactttt tttcttcctc 
tttccctttc tctctctttt ttctttttct ttctttcttt tctttctttc gttcttcttt 
tccttccttc ctttccttcc ttcctttcgt ttgtttcttc tttcttgttt tactttctct 
cttatttttt cttctttctt tcttgttttt tttctttctc tctctttttc tttccttctt 
tttttttttt tttttttgag acagggcagt gctctgtctc cgtggctgga gtacagtggc 
ccaatcagag ctcactgcag cctcgacctc ctgggctcaa gcgatactca gcctccagag 
tagctggtac cacaggcatg caccaccaca tccggctttt tttttttttt tttttttttt 
tttttttgag acagggtctc actctgtcgc ccagactgga gtgcagtggc ccaatctcgg 
ctcattgcaa cctccacctc ctgggctcaa gcgatcctcc cacctcagcc tcccaggtag 
ctgtgactac aggcgcatgc ctccgcgact acttttttgt tgttgttgtt gtttgtttgt 
ttttgtagag actgggtctt gctgtgttgc ccgggctggt cttgaattcc cgagctcaag 
cggtccaacc gcctcggctt cccaaagtgc tgggattaca ggcgtaaacc actgcgcccc 
acccctctcc tggttttcaa tcccgttttg ttattcacac cccttcctct ccccgatccc 
cgagttctat ccccgcaccc ttacctcccc gccgcgttca atccccgccc ctctatcgac 
cagcgacgtt ctagccagct ctccaggcgc gctgcgttca gtccctgccc tccagaccca 
ccctattctg tctcattact ccacgctacc ctatcccagc ttccttccac tttcacgcgc 
ttcttctcct cccattcctt cggtgcacgc gaaaccccca atatttccct accaccctcg 
cgttctggct gcgtcccccg tccccgaacg cagtccagtg ccacagccca gctccaaccc 
caaacccaga ccctgccctc cgtgctttga ttccgtcccc tttctttctc ccagccgcag 
aggcggcgct gctcccccaa aacgacacgc gcttggaccc cgaagcctat ggcgccccgt 
gcgcgcgcgg ctcgcagccc tggcaggtct cgctcttcaa cggcctctcg ttccactgcg 
cgggtgtcct ggtggaccag agttgggtgc tgacggccgc gcactgcgga aacaagtagg 
aggagatcca tccccgagga cgccacgggg ggctgtggag gcggcctcca gggaggagcg 
ggcagggcgg ggtctctcgg aactcccaca gctggaggtg cggtccccgg tgtccttcca 
gcaggaggag agaccgggct tgcgctgtgg ccgccagggg acggtgtggt ccttttccgc 
atttctgggc ccgtcactcc tctcccgctg attctccttg agctctagga ggaggtggtt 
gcctgcaacg gatagaagcc agggtccggt gtggtcagga atttagaact aaaggaaaag 
gttcacttcg tgagtccccg ttgaaggagg aagggttggg tattaccaca gagaaaatgt 
ggagttgggc tgggctcggt ggctcacgcc tgtaatccca gcactttggg aggccaaggc 
gggtggatca cctgaggtca ggagttcaag accagcctgg ccaacatggt gaaaccactt 
ctcaactaaa aataaaataa aataaaaaat aaaaaatttt aaaaaaatta aaaaaaaaaa 
aaggctgggt gtggtggcgg gtgcctgtaa tcccagctac tcaggaagga ggctgaggca 
ggagaactgc ttgaacctgg gaggtggagg ttgcagtgag ccgagattgc gccattgcat 
tctagcctgg gtgacaagag cgaaactctg tttcaacaaa gaaaagaaaa agagaaaaga 
aaatgtggaa ggcttaccta ggtgtccagg cccccagccc tcctcaattc ctgcagatcc 
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tcagagctca aacaactgat tcctcctccc catgtccact gaggtcccct tctcccacaa 
ggccctcttc cctcagactc ttcctatctc caggccctgc ttcactgccc acctgctttc 

ccagtccctg tgaagggttt gccttcacat gcctcttcct tcccccaggc cactgtgggc 
tcgagtaggg gatgaccacc tgctgcttct tcagggcgag cagctccgcc ggacgactcg 
5 ctctgttgtc catcccaagt accaccaggg ctcaggcccc atcctgccaa ggcgaacgga 
tgagcacgat ctcatgttgc taaagctggc caggcccgta gtgccggggc cccgcgtccg 
ggccctgcag cttccctacc gctgtgctca gcccggagac cagtgccagg ttgctggctg 
gggcaccacg gccgcccgga gaggcaagag ctggggctct gaggccagaa cctcaggagg 
agggggctga gggcctgaac ccctgggtct gaggaaggat gggctgggga ctggattcct 

10 ggatctgagg gaggacgggc tggggtccta gatgcctggg tctgtgagtc tgaggggagg 
aggggctggg ggcctggact cctgggtcta agtggggagg ggctggggcc aggattcttg 
agtctgaagg aggaggggct ggggcttagg atagaaacgg tcttgtatct ggactcctgg 
ctccccaagg attgggggct ggacccaggg attactggca tattctccct tcagtgaagt 
acaacaaggg cctgacctgc tccagcatca ctatcctgag ccctaaagag tgtgaggtct 

15 tctaccctgg cgtggtcacc aacaacatga tatgtgctgg actggaccgg ggccaggacc 
cttgccaggt agggtctgaa cagggagagt ctctgactcc tgggagggag gacagggagg 
ttatgggaaa agagcagacc ctgtgcccga tcccaaactc cattcccaaa cccatccttg 
accccaactc ttacccagac ctaaccccct cctcatccct atcctcaatc ccatttccat 
cctaacccca ccccattccc atctccaagc ccattttcat cccctcacct tccatgaact 

20 acaatcccaa cccaagtctc actgtgcctt cattctcatc ccccagccca acctcccata 
acctgaagtc cacctccatt cctaccttcc agctcatacc taattccaac cccatcccat 
cctcgtcttt atcccaaccc aaccccttcc ttccccacca ctgccccaga tcccaaagtg 
acagctctca cgttggcaca tttatttgat ctctcctttc tgccaccccc agagtgactc 
tggaggcccc ctggtctgtg acgagaccct ccaaggcatc ctctcgtggg gtgtttaccc 

25 ctgtggctct gcccagcatc cagctgtcta cacccagatc tgcaaataca tgtcctggat 
caataaagtc atacgctcca actgatccag atgctacgct ccagctgatc cagatgttat 
gctcctgctg atccagatgc ccagaggctc catcgtccat cctcttcctc cccagtcggc 
tgaactctcc ccttgtctgc actgttcaaa cctctgccgc cctccacacc tctaaacatc 
tcccctctca cctcattccc ccacctatcc ccattctctg cctgtactga agctgaaatg 

30 caggaagtgg tggcaaaggt ttattccaga gaagccagga agccggtcat cacccagcct 
ctgagagcag ttactggggt cacccaacct gacttcctct gccactcccc gctgtgtgac 
tttgggcaag ccaagtgccc tctctgaacc tcagtttcct catctgcaaa atgggaacaa 
tgacgtgcct acctcttaga catgttgtga ggagactatg atataacatg tgtatgtaaa 
tcttcatgtg attgtcatgt aaggcttaac acagtgggtg gtgagttctg actaaaggtt 

35 acctgttgtc gtgatctgac cacgtcccgg tgaaagcgtg tgtccaggga agaagtgcac 
agggtagccc ccaatcccaa ccttccatcc ccaaccctta gggatgatgg aaga 



SEQ ID NO. 19 

40 Hkll amino acid 

MQRLRWLRDWKSSGRGLTAAKEPGARSSPLQAMRILQLILLALA 
TGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAHCLKPRYIV 
HLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSITWAVRP 
45 LTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTM 

VCASVQEGGKDSCQGDSGGPLVCNQSLQGI ISWGQDPCAITRKPGVYTKVCKY VDWIQ ETMKNN 

SEQ ID NO. 20 

Hkll amino acid 

50 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTR 
LLCGATLIAPRWLLTAAHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSL 
PNKDHRNDIMLVKMASPVSITWAVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHT 
LRC AN I T 1 1 EHQKCENAYPGN I T DTMVCAS VQEGGKDSCQGDSGGPLVCNQSLQGI I S 
55 WGQDPCAITRKPGVYTKVCKYVDWIQETMKNN 
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SEQ ID NO. 21 
KLKll nucleic acid 

aggaatctgc gctcgggttc cgcagatgca gaggttgagg tggctgcggg actggaagtc 61 

atcgggcaga ggtctcacag cagccaagga acctggggcc cgctcctccc ccctccaggc 121 

5 catgaggatt ctgcagttaa tcctgcttgc tctggcaaca gggcttgtag ggggagagac 181 

caggatcatc aaggggttcg agtgcaagcc tcactcccag ccctggcagg cagccctgtt 241 

cgagaagacg cggctactct gtggggcgac gctcatcgcc cccagatggc tcctgacagc 301 

agcccactgc ctcaagcccc gctacatagt tcacctgggg cagcacaacc tccagaagga 361 

ggagggctgt gagcagaccc ggacagccac tgagtccttc ccccaccccg gcttcaacaa 421 

10 cagcctcccc aacaaagacc accgcaatga catcatgctg gtgaagatgg catcgccagt 481 

ctccatcacc tgggctgtgc gacccctcac cctctcctca cgctgtgtca ctgctggcac 541 

cagctgcctc atttccggct ggggcagcac gtccagcccc cagttacgcc tgcctcacac 601 

cttgcgatgc gccaacatca ccatcattga gcaccagaag tgtgagaacg cctaccccgg 661 

caacatcaca gacaccatgg tgtgtgccag cgtgcaggaa gggggcaagg actcctgcca 721 

15 gggtgactcc gggggccctc tggtctgtaa ccagtctctt caaggcatta tctcctgggg 781 

ccaggatccg tgtgcgatca cccgaaagcc tggtgtctac acgaaagtct gcaaatatgt 841 

ggactggatc caggagacga tgaagaacaa ttagactgga cccacccacc acagcccatc 901 

accctccatt tccacttggt gtttggttcc tgttcactct gttaataaga aaccctaagc 961 

caagaccctc tgcgaacatt ctttgggcct cctggactac aggagatgct gtcacttaat 1021 

20 aatcaacctg gggttcgaaa tcagtgagac ctggattcaa attctgcctt gaaatattgt 1081 

gactctggga atgacaacac ctggtttgtt ctctgttgta tccccagccc caaagacagc 1141 
tcctggccat atatcaaggt ttcaataaat atttgctaaa tgagtg 

SEQ ID NO. 22 
25 KLKll nucleic acid 

gene 2313. .7622 

mRNA join(2313. .2398, 4189. .4263,5061 . .5217,5545. .5810, 
30 6627. ,6763,7158. .7622) 

CDS join(4224. .4263,3061. .5217,5545. .5810,6627.-6763, 7158. .7310) 



35 tgataatagt gttctctctc ctcattggtc 

ctcgactctt tatgttgtct tgacagcctc 

tctctcctca ttggtcaggg ccccagccat 

agttccaccc ttcttccctg ggattggccc 

gccattgcca ttgtcctccg ggaaagtgat 

40 agccctcccc aaggcccagg actgggttga 

tctcccttgt tcagacagta cttctcttcc 

gtgtggggga gtccttcaag gtctggtgtc 

tggcatccct ggagtctaca cctatatttg 

gaggaacaac tgacctgttt cctccacctc 

45 tggccctcag agcaccaata tctcctccat 

gggaacttct tggaacttta actcctgcca 

gaagtgtgca atagtctgga ataaatataa 

agtcctcatg ctggttgaga ctggaagaag 

agaaacagag ctcaaataag gccaggcaca 

50 ggaagctgag gcaggtggat cacctgaggt 

gtgaaacccc aactctacta aaaatgcaaa 

aatcccagct actcaggagg ctgagacagg 

gcagcgagcc gagattgaac cattacactc 

caaaaacaaa caaacaaaaa acccagtgct 

55 tactcagaaa tggagtagaa aaagttactt 

cgcctgtaat cccagcactt tgggaggccg 

gagatcagcc tggcaacaca gtgaaatctt 



agggccccag ccattgtcct tgagagaatg 61 
ccctgagatt ggtcattaat gactgtgctc 121 
tgtccttgag agaacctctg tcctttatgg 181 
ctagagacag tggttcttct cttttggtta 241 
tatactcttt tgtctaatga ccagacttgg 301 
agggttgggg aggaaaacag aaataagatg 361 
cttccagggt gattctgggg gccccctggt 421 
ctgggggtct gtggggccct gtggacaaga 481 
caagtatgtg gactggatcc ggatgatcat 541 
cacccccacc ccttaacttg ggtacccctc 601 
cacttcccct agctccactc ttgttggcct 661 
gcccttctaa gacccacgag cggggtgaga 721 
atgaaggagg ggccatgtct gtccatttga 7 81 
gactcagcag tttccctatc tcataggagt 841 
gtggctcaca cctgtaatcc catcactttg 901 
caggaactcg ggaccagcct ggtcaacata 961 
aattagccag gcatggtggc gcatgcctgt 1021 
agaatagcat gaacccgtga ggcagaggct 1081 
cagcctgggc gacagagcga gactccatct 1141 
caaataggat gagggtcttc cctgagtagt 1201 
ttaataatat aggccgggtg cagtggccca 1261 
aggtgggagg atggcttgag ctcagatttc 1321 
gtcactacaa aaacacaaaa aattagctgg 1381 
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gtgtggtggt gcgtgcctgt 
gagccgggga ggtggaggct 
aataaagtga gaccttgtct 
cttcatctgg cataatagaa 
5 agacccgaaa aagaaaaaga 
caacatttat gaccatttaa 
ctgcagttta ctttcttgta 
gttctaataa gacgaagggt 
tttaatcctc ctgcccaccc 

10 gaccatctcc ccaaatgcac 
taccagcgga ggcacatttg 
ctggaagcct gatcccaacc 
agaaactgag ccttgcaggg 
agtaggaaga ggaagcacct 

15 ctcccctgcc ttgctccaca 
acctaactga aaacaaacaa 
cagaggttga ggtggctgcg 
aagtgaacag ctggactcgg 
agcagaggag cgaggcccca 

20 ctttcctgga ctcggcttcc 
ctgtgcctgg cagcagcccc 
aggagaaggg agcggcctag 
gagtggctgg gacgggagga 
agacaaattg ccagagatgc 

25 ttagagaaag ggccacacag 
gaggacagag aaaggcagac 
gaattactga atgacaggga 
agggaaagga aggctgcaga 
aggggcttgg agaggtggca 

30 cctcacacac accccgcccc 
ccgcccaggg ccgcccccct 
cccagcctac ctgctgtagc 
ccccagcccc agagcctgtg 
gtcagccctc cccaaggaca 

35 cttctctgct tgggtggggg 
ccgcagcctc agttacctct 
cacccccgct ggccagggca 
ctttgcttgg ctgtccttca 
gtgatggggg tcccagacat 

40 aaaggctcca tatcgctctg 
ctggttcccc ttgtccttcc 
aggaaaacat cctctgacat 
ccatggcttg acagctcaaa 
aagcttctag ctgtgaccag 

45 aagtgtttgg agatctgagt 
aagtggggag cgggagcagg 
cgatttcccc gtaaagtgat 
cccgctcctc ccccctccag 
caggtacgca ggggatgggg 

50 aaccctctgt gtctggacag 
acctgaactg gagcagtggt 
gccaaaggag gagaggtcaa 
gagttggggg agcagtcact 
aagtatttcc tttttttttt 

55 tgccgtggtg ccaacacggc 
gccatctcag cttccccggt 
gtttaattgt ttgtagagat 
aactcctggg ctcaagtaat 
tgagccactg catttgacct 



agtcccagct acttgggaag 
gcaaagagcc gagatcatgc 
caaaaacaaa aacccagcaa 
atagtgccca gagcttataa 
aaattgttag ctccaaaata 
tccaatgtcc ataaaacgta 
atgaagcata cattgtatct 
ggagtgcagg cttggaaagc 
cttggattct gtctccactg 
tgaagggaaa ctggaggagg 
ctgagccccc ccgcagtctg 
tcccctgcaa gcaggtctgt 
gtggagtccc ttgtccccac 
aggtttgagg ccagggctgg 
cctggtcagg ggagagaggg 
gctgggagaa gcaggaatct 
ggactggaag tcatcgggca 
gctgcctggg cggcagggag 
gaggagccct ggggtggagc 
acaggcGctg acctgcctcc 
acctgtgtga catcccagca 
gggaggccag gggcccacct 
aaaagagaga cggagattag 
agtcagagag actgactgag 
agccagacag agagagaaga 
agacacatag ggacagaaag 
atgacacata gaacgagaca 
cagacagaca gacagaggga 
caggcaggca gccagtgcct 
ggggcattaa ggcagggctt 
gccagcccgc ctgcctggtg 
tgccgccact gccgtctccg 
agtccaggag gaaagggaag 
cctgtcccac tcgggcaccc 
ttcctggcct ctctctacac 
aatctccatg gcttcagctg 
gcggagggca ctggccctcc 
aaagggcagg ggtttggcgg 
tgtctggggc tgagccccct 
ctgcgaagac aatgaaaaag 
accccaagct gctggggcct 
gtgccgggga ggtcccatgg 
gcccctccca acgacttcca 
gccctctcca aggccaccct 
gctgtgagaa acaggggatt 
tgagggagag aggagagggc 
gcggccccat gtccctcctt 
gccatgagga ttctgcagtt 
gcagggcagg atcctccctc 
tgacagggct gattccaaat 
catgagggcc tggatgccct 
gggggtcgta aagggtcccg 
caaagcgccc aggacagggg 
ttcccagaga caaagtcttg 
tcactgcagt cttgacttcc 
agctgggacc acaggcacct 
gggggaggag gtctcactat 
cctcccacct tagcctctca 
tatggaagta ttttcatcct 



ctgaggtggg aggatcaccc 1441 
cactgcactc cagcctgggc 1501 
tataaataag acacatgttt 1561 
gcttttcaag agtccacaaa 1621 
ccagatgaaa gctgcaaagt 1681 
gcattctttc cactagccaa 1741 
ttaatgtggg acgtggcttt 1801 
aggagagctc agcctacgtc 1861 
ggactcaaga ggtgaggaga 1921 
gagggagtga ggggtgatca 1981 
ctctttccaa gtggaccctc 2041 
cacccccatc tctcagatga 2101 
gtcataaggg tagtcatagt 2161 
ctgctgtcag aacctaggcc 2221 
gaggaaagcc aagggaaggg 2281 
gcgctcgggt tccgcagatg 2341 
gaggtctcac agcagccagt 2401 
aagcgggcag gggaagggtc 2461 
acagccaagg gctctgttcc 2521 
cccaccctcc ggtcctgccc 2581 
caccccccct ctccttgcaa 2641 
gggctggggc tgtggagagg 2701 
atggaagaag agggatttca 2761 
agacacaaag atagaaggaa 2821 
gtggagatgg agacagggac 2881 
agaaaaatca cacaaagtca 2941 
cagattcaga gactcagggc 3001 
ggctgagaca cagggagaag 3061 
cagaggcctc cggggagggc 3121 
ggaggccagt catcctgggc 3181 
cctggcacct ggcgctccaa 3241 
ccgccactgg gcccccagag 3301 
ctgcccctcc ccgtccaggt 3361 
atttctccct ctgctctgtc 3421 
ctctcacctc cgatggctgt 3481 
ctgagctggc cctctgctcc 3541 
cctcgaccag ccccgcccag 3601 
acagggcttc agcaagccgg 3661 
actcccctcc agcagacctc 3721 
gggtggctac ggaacggtgt 3781 
ggccagctct caaggcaaga 3841 
ctgacttgaa cagggccgaa 3901 
catggttctt ggtatctcgg 3961 
agacacctaa gatatatttt 4021 
tccccaacct tgtttctccc 4081 
atgagccagc ccccccctcc 4141 
gttcccagag gaacctgggg 4201 
aatcctgctt gctctggcaa 4261 
ttgaatctct gggatcccct 4321 
tacagaacaa cccataaggc 4381 
tctagataat ccctttaaat 4441 
tggaggggct gaggaagctg 4501 
ctactgacca accagtatgg 4561 
ctctattgtc caggctggag 4621 
cgggcttaag tgatccttaa 4681 
gccaccaagc caggctaatt 4741 
gtttgcctag gctgatctag 4801 
aagtgctggg attacaggca 4861 
ttaatacccg accccagcat 4921 
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ccagggcaac ccagagggac accagaccag 
ccccaccccc atttctggga gtcctcctgg 
tttgctctca ccccctccag ggcttgtagg 
gtgcaagcct cactcccagc cctggcaggc 
5 tggggcgacg ctcatcgccc ccagatggct 
ggtgcggggg ctggggcggt gccggggtgg 
aagctcaggg ataggggtgc tggtaagggg 
ggttgatggg ctcgagttgg tattgaaggt 
tggctgggaa gggggcttcg gtgggagacg 

10 ttcatcctca aaggtgtcac tcacctctcc 
caactactgt ctctcccacc tcagccgcta 
gaaggaggag ggctgtgagc agacccggac 
caacaacagc ctccccaaca aagaccaccg 
gccagtctcc atcacctggg ctgtgcgacc 

15 tggcaccagc tgcctcattt ccggctgggg 
cagaggggaa cctggcaggg ggtggtgagg 
gggcatcaga gatgcggttc acagtgacga 
acgtcaggat aggggggtgg ggacaaaagt 
gcaatcatac atccataacc tcctggttgt 

20 gaatcttgat tttcttctct ataaaatgag 
tagagataat gtatatcaag caactgacat 
cgtggctcac gcctgtaatc ccagcacttt 
tcaggacttt gataccagcc tggccaacgt 
aaattagttg ggcgtggttg tgtgcgcctg 

25 gagaatcgct taaacttggg agacggaggt 
ccagcctggg caacagagca agactctgtc 
attgctgttg ctattgttac aagaagagag 
cccccagggg cgggatcaca gcaagcactg 
cacagcccct cacgctgttt ccacagtacg 

30 caccatcatt gagcaccaga agtgtgagaa 
ggtgtgtgcc agcgtgcagg aagggggcaa 
ccacagcccc atccccatcc ccagcttcaa 
ccccaacctc aacccgccga cccctgcaac 
tctgacctca gcacaaactt cagctccatc 

35 atccccaaac tcgtttttga gcctaacccc 
atcgctaaac ctatcacctt tcccagtgcc 
caccgtcccc acctcctccc tggctaacac 
tctctccccg tgcccagggt gactccgggg 
gcattatctc ctggggccag gatccgtgtg 

40 aagtctgcaa atatgtggac tggatccagg 
cccaccacag cccatcaccc tccatttcca 
ataagaaacc ctaagccaag accctctacg 
gatgctgtca cttaataatc aacctggggt 
tgccttgaaa tattgtgact ctgggaatga 

45 cagccccaaa gacagctcct ggccatatat 
tgaatctact gagtgcttac tatgtgctag 
attttttgac agagtctcgc tctgtcaccc 
cactgcaacc tccacctcct gggttcaagc 
gggattacag gtgcctacca ccacatccgg 

50 gcttcaccat gttggccagg ctggtctcga 
ggcctcccaa actcctggga ttacagacgt 
ttaattaaaa gaaattaaat taattaatct 
ggctggagtg cagtaacaat cacagctcac 
tgtcctccct cagcctccag agtagctggg 

55 atttttgtat ttttcgtaga gacagaggtc 
cctgggctca agcagtctgt cctcctcagc 
tcgctgtgcc tggcctccaa gcactttcaa 
tgaggtcggt actgttttca tacctatttt 
agtcacttgc tcacagtcac gtggctagga 



ggcccagacc acccactctc tttctctcct 4981 
tctaccacct ctccttcctg agccccttct 5041 
gggagagacc aggatcatca aggggttcga 5101 
agccctgttc gagaagacgc ggctactctg 5161 
cctgacagca gcccactgcc tcaagccgtg 5221 
ggggctggga atggggagat ggatggagag 5281 
attagagatg gggatgggta gtgtcagcaa 5341 
ggggggatga atggggttgg gatggggcta 5401 
tggaagaggt tggaagcaga gcgatgtttc 54 61 
cacccatgtc tcccccgacc tttcctcctc 5521 
catagttcac ctggggcagc acaacctcca 5581 
agccactgag tccttccccc accccggctt 5641 
caatgacatc atgctggtga agatggcatc 5701 
cctcaccctc tcctcacgct gtgtcactgc 5761 
cagcacgtcc agcccccagt gtaggagcac 5821 
agggagtggt caggattgtg gaagggttca 5881 
tgtgggataa gttgagagga tgtgtggaaa 5941 
tggggccttg gagtcagacg gacgggatat 6001 
aagaccttag gcaagcagct tcacctctct 6061 
aatgattata cccacctgtc aggattggat 6121 
aaatcattta ttggatagca ggctgggcac 6181 
gggaggccga ggtgggaaga tcacctgagg 6241 
ggtgaaatcc catctctact aaaaatgtga 6301 
taatcccagc tactcgggag gttgaggcag 6361 
tgcagtgagc caagatcacg ccactgcact 6421 
tcgaaaaaaa aaaaaaaaaa gctggatagc 6481 
gtgagttggc tgcgtctaag gacagggatt 6541 
cattagggcra ggtggcaggg ggctcattcc 6601 
cctgcctcac accttgcgat gcgccaacat 6661 
cgcctacccc ggcaacatca cagacaccat 6721 
ggactcctgc caggtcagtg tggtctccaa 6781 
tgacatcttt accgacatcc acaatttcat 6841 
tcccaatcca tctcttcccc tgttcccgtt 6901 
cccgtttcca caccatttcc agctccaacc 6961 
atcctttatc ccacccataa tcccagcttt 7021 
tacccatcct gtctcggccc cactcctaag 7081 
catgctcaac gctttctctg accgacattc 7141 
gccctctggt ctgtaaccag tctcttcaag 7201 
cgatcacccg aaagcctggt gtctacacga 7261 
agacgatgaa gaacaattag actggaccca 7321 
cttggtgttt ggttcctgtt cactctgtta 7381 
aacattcttt gggcctcctg gactacagga 7441 
tcgaaatcag tgagacctgg attcaaattc 7501 
caacacctgg tttgttctct gttgtatccc 7561 
caaggtttca ataaatattt gctaaatgag 7621 
accctgatcc aatggctttt attttatttt 7681 
aggctggagt acagtggtgc tatctctgct 7741 
aattctcctg cctcagcctc ctgaatagct 7801 
ctaatttttg tattttttag tagagatggg 7861 
actcctgacc tcagatgatc tgccctcctt 7921 
gagccaccgc gcccgcccgg ctttcattta 7981 
atttaggaga cagtcttgct ctgttgccca 8041 
ggcaatctca atttcctggg gtcaagtgat 8101 
actacaggca catgccacga agcccagcta 8161 
tcagtatgtt gccccggcta gtctcaaact 8221 
ctccaaaagt ggtgagatta caggcatgag 8281 
atgtatcaac ttaatcctca caaaaccctg 8341 
atagttgaag aaacagacac agagaagcaa 8401 
gagcaaggat ctgaagcaag gcgatctctt 8461 
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aattaccaag tgatgttcct ggagtaaggc tctgtttgtt tcctttcctg taaaatgctg 8521 
catgcaaaag tataacacag taagtaaaga agtcagttag cctgcacata ctaagaccta 8581 
accaaaggag cttattgttt ttctccaact tccatgatag gtaattagat agtggagacc 8641 
tctgctggcc aatatggtag ccactaaccg cagctggctc ttccaattaa aattacataa 8701 

5 agccagaaat gtaactcctc tgtctcactt gttatatctc caaggctgga tagccacatg 8761 
tgactggtgg tggctggatt agctagtgca tataaaacat cactgcagaa agttcagctg 8821 
agcagcactg agttagatgg cctctgaaga ggatgtccca cggagagaat ccagaactca 8881 
ggatcttttt tttttttttt ctttgcgaca gagtcttgct ctgtcaccca ggctggagtg 8941 
cagtggcgtg atctcggctc actgcaactt ctgcctccca ggttcaagca attctcctgc 9001 

10 ctcagcctcc ctagtagctg ggactacagg cctgtgccaa catccccagc taatttttgt 9061 
gtctttttag tagagatggg gtttcactat gttggccagg ctggtctcga actcctgacc 



